WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, C07K 14/47, 16/18, C12Q 
1/68 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 99/33982 

8 July 1999 (08.07.99) 



(21) International Application Number: PCT/US98/27610 

(22) International Filing Date: 22 December 1998 (22.12.98) 



(30) Priority Data: 

60/068,755 
60/080,664 
60/105,234 
60/105,877 
09/217,471 



23 December 1997 (23.12.97) US 

3 April 1998 (03.04.98) US 

21 October 1998 (21 .10.98) US 

27 October 1998 (27.10.98) US 

21 December 1998 (21 .12.98) US 



(71) Applicants: CHIRON CORPORATION [US/US]; 4560 Horton 

Street - R440, Emeryville, CA 94608 (US). HYSEQ INC. 
[US/US]; 675 Almanor Avenue, Sunnyvale, CA 94086 (US). 

(72) Inventors: WILLIAMS, Lewis, T.; 3 Miroflores, Tiburon, CA 

94920 (US). ESCOBEDO, Jaime; 1470 Lavoma Road, 
Alamo, CA 94507 (US). INNIS, Michael, A.; 315 Con- 
stance Place, Moraga, CA 94556 (US). GARCIA, Pablo, 
Dominguez; 882 Chenery Street, San Francisco, CA 94131 
(US). SUDDUTH-KLINGER, Julie; 280 Lexington Road, 
Kensington, CA 94707 (US). REINHARD, Christoph; 1633 
Clinton Avenue, Alameda, CA 94501 (US). GIESE, Klaus; 
Chausseetrabe 92, D-101 15 Berlin (DE). RANDAZZO, Fil- 
ippo; Apt. 403, 690 Chestnut Street, San Francisco, CA 
94133 (US). KENNEDY, Giulia, C; 360 Castenada Av- 



enue, San Francisco, CA 941 16 (US). POT, David; 1565 5th 
Avenue #102, San Francisco, CA 94112 (US). KASSAM, 
Altaf; 2659 Harold Street, Oakland, CA 94602 (US). LAM- 
SON, George; 232 Sandringham Drive, Moraga, CA 94556 
(US). DRMANAC, Radoje; 850 East Greenwich Place, Palo 
Alto, CA 94303 (US). CRKVENJAKOV, Radomir, 762 
Haverhill Drive, Sunnyvale, CA 94068 (US). DICKSON, 
Mark; 1411 Gabilan Drive #B, Hollister, CA 95025 (US). 
DRMANAC, Snezana; 850 East Greenwich Place, Palo 
Alto, CA 94303 (US). LABAT, Ivan; 140 Acalanes Drive, 
Sunnyvale, CA 94086 (US). LESHKOWITZ, Dena; 678 
Durshire Way, Sunnyvale, CA 94087 (US). KITA, David; 
899 Bounty Drive, Foster City, CA 94404 (US). GARCIA, 
Veronica; Apartment 412, 396 Ano Nuevo, Sunnyvale, CA 
94086 (US). JONES, Lee, William; 396 Ano Nuevo #412, 
Sunnyvale, CA 94086 (US). STACHE-CRAIN, Birgit; 345 
South Mary Avenue, Sunnyvale, CA 94086 (US). 

(74) Agent: BLACKBURN, Robert, P.; Chiron Corporation, P.O. 
Box 8097, Emeryville, CA 94662-8097 (US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GD, 
GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, 
KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, 
TD, TO). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: HUMAN GENES AND GENE EXPRESSION PRODUCTS I 
(57) Abstract 

This invention relates to novel human polynucleotides and variants thereof, their encoded polypeptides and variants thereof, to genes 
corresponding to these polynucleotides and to proteins expressed by the genes. The invention also relates to diagnostic and therapeutic 
agents employing such novel human polynucleotides, their corresponding genes or gene products, e.g., these genes and proteins, including 
probes, antisense constructs, and antibodies. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 
Barbados 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BB 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZVV 


Zimbabwe 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazaksfan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/33982 PCT/US98/27610 
NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS I 
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60/105,234, filed October 21, 1998, each of which applications are incorporated herein by 
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10 Field of the Invention 

The present invention relates to novel polynucleotides, particularly to novel 
polynucleotides of human origin that are expressed in a selected cell type, are differentially 
expressed in one cell type relative to another cell type (e.g., in cancerous cells, or in cells of a 
specific tissue origin) and/or share homology to polynucleotides encoding a gene product 

15 having an identified functional domain and/or activity. 

Background of the Invention 

Identification of novel polynucleotides, particularly those that encode an expressed 
gene product, is important in the advancement of drug discovery, diagnostic technologies, 

20 and the understanding of the progression and nature of complex diseases such as cancer. 
Identification of genes expressed in different cell types isolated from sources that differ in 
disease state or stage, developmental stage, exposure to various environmental factors, the 
tissue of origin, the species from which the tissue was isolated, and the like is key to 
identifying the genetic factors that are responsible for the phenotypes associated with these 

25 various differences 

This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel 
polynucleotides. 



30 Summary of the Invention 

This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides 

1 
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and to proteins expressed by the genes. The invention also relates to diagnostic and 

therapeutic agents employing such novel human polynucleotides, their corresponding genes 

or gene products, e.g., these genes and proteins, including probes, antisense constructs, and 

antibodies. 

5 Accordingly, in one embodiment, the present invention features a library of 

polynucleotides, the library comprising the sequence information of at least one of SEQ ID 
NOS: 1 -844. In related aspects, the invention features a library provided on a nucleic acid 
array, or in a computer-readable format. 

In one embodiment, the library is comprises a differentially expressed polynucleotide 

10 comprising a sequence selected from the group consisting of SEQ ID NOS:9, 39, 42, 52, 62, 
74, 1 19, 172, 317, and 379. In specific related embodiments, the library comprises: 1) a 
polynucleotide that is differentially expressed in a human breast cancer cell, where the 
polynucleotide comprises a sequence selected from the group consisting of SEQ ID NOS: 4, 
9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 114, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 

15 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388; 2) a polynucleotide differentially 
expressed in a human colon cancer cell, where the polynucleotide comprises a sequence 
selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 
288, 317, 357, 362, and 374; or 3) a polynucleotide differentially expressed in a human lung 
cancer cell, where the polynucleotide comprises a sequence selected from the group 

20 consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160,260,308,323,349, 
361, 369, 371, 379, 395, 381, and 400. 

In another aspect, the invention features an isolated polynucleotide 
comprising a nucleotide sequence having at least 90% sequence identity to an identifying 
sequence of SEQ ID NOS: 1-844 or a degenerate variant thereof. In related aspects, the 

25 invention features recombinant host cells and vectors comprising the polynucleotides of the 
invention, as well as isolated polypeptides encoded by the polynucleotides of the invention 
and antibodies that specifically bind such polypeptides. 

In one embodiment, the invention features an isolated polynucleotide comprising a 
sequence encoding a polypeptide of a protein family selected from the group consisting of: 

30 4 transmembrane segments integral membrane proteins, 7 transmembrane receptors, 

ATPases associated with various cellular activities (AAA), eukaryotic aspartyl proteases, 
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GATA family of transcription factors, G-protein alpha subunit, phorbol esters/diacylglycerol 

binding proteins, protein kinase, protein phosphatase 2C, protein tyrosine phosphatase, 

trypsin, wnt family of developmental signaling proteins, and WW/rsp5/WWP domain 

containing proteins. In a specific related embodiment, the invention features a 

5 polynucleotide comprising a sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 

315, 341, 63, 1 16, 134, 136, 151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 

382, 169, 23, 291, 324, 330, 341, 353, 188, 379 , and 395. 

In another embodiment, the invention features a polynucleotide comprising a 

sequence encoding a polypeptide having a functional domain selected from the group 

10 consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 

bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. In a specific related 
embodiment, the invention features a polynucleotide comprising a sequence of one of SEQ 
ID NOS: 1 16, 251, 374, 97, 136, 242, 379, 306, 386, 18, 335, 61, 306, 386, 322, 306, and 

15 395. 

In another aspect, the invention features a method of detecting differentially 
expressed genes correlated with a cancerous state of a mammalian cell, where the method 
comprises the step of detecting at least one differentially expressed gene product in a test 
sample derived from a cell suspected of being cancerous, where the gene product is encoded 

20 by a gene corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 
65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 
317, 338, 379, 384, 386, 388, 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, 374, 
9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, 
and 400. Detection of the differentially expressed gene product is correlated with a 

25 cancerous state of the cell from which the test sample was derived. In one embodiment, the 
detecting is by hybridization of the test sample to a reference array, wherein the reference 
array comprises an identifying sequence of at least one of SEQ ID NOS: 1-844. 

In one embodiment of the method of the invention, the cell is a breast tissue derived 
cell, and the differentially expressed gene product is encoded by a gene corresponding to a 

30 sequence of at least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 
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144, 130, 157, 162, 172, 178, 183, 202,214,219,223, 258,298,317,338,379,384, 386, 

and 388. 

In another embodiment of the method of the invention, the cell is a colon tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
a sequence of at least one of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 
357, 362, and 374. 

In yet another embodiment of the method of the invention, the cell is a lung tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
asequence of at least one of SEQ IDNOS:9, 34, 42,62,74, 106, 119, 135, 154, 160, 260, 
308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

Other aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 

Detailed Description of the Invention 

The invention relates to polynucleotides comprising the disclosed nucleotide 
sequences, to full length cDNA, mRNA and genes corresponding to these sequences, and to 
polypeptides and proteins encoded by these polynucleotides and genes. 

Also included are polynucleotides that encode polypeptides and proteins encoded by 
the polynucleotides of the Sequence Listing. The various polynucleotides that can encode 
these polypeptides and proteins differ because of the degeneracy of the genetic code, in that 
most amino acids are encoded by more than one triplet codon. The identity of such codons 
is well-known in this art, and this information can be used for the construction of the 
polynucleotides within the scope of the invention. 

Polynucleotides encoding polypeptides and proteins that are variants of the 
polypeptides and proteins encoded by the polynucleotides and related cDNA and genes are 
also within the scope of the invention. The variants differ from wild type protein in having 
one or more amino acid substitutions that either enhance, add, or diminish a biological 
activity of the wild type protein. Once the amino acid change is selected, a polynucleotide 
encoding that variant is constructed according to the invention. 

The following detailed description describes the polynucleotide compositions 
encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding a 
full-length gene product, expression of these polynucleotides and genes, identification of 
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structural motifs of the polynucleotides and genes, identification of the function of a gene 

product encoded by a gene corresponding to a polynucleotide of the invention, use of the 

provided polynucleotides as probes and in mapping and in tissue profiling, use of the 

corresponding polypeptides and other gene products to raise antibodies, and use of the 

5 polynucleotides and their encoded gene products for therapeutic and diagnostic purposes. 

I. Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 
necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID 

1 0 NOS: 1 -844; polynucleotides obtained from the biological materials described herein or other 
biological sources (particularly human sources) by hybridization under stringent conditions 
(particularly conditions of high stringency); genes corresponding to the provided 
polynucleotides; variants of the provided polynucleotides and their corresponding genes, 
particularly those variants that retain a biological activity of the encoded gene product (e.g., 

1 5 a biological activity ascribed to a gene product corresponding to the provided 

polynucleotides as a result of the assignment of the gene product to a protein family(ies) 
and/or identification of a functional domain present in the gene product). Other nucleic acid 
compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. 

20 The invention features polynucleotides that are expressed in cells of human tissue, 

specifically human colon, breast, and/or lung tissue. Novel nucleic acid compositions of the 
invention of particular interest comprise a sequence set forth in any one of SEQ ID NOS:l- 
844 or an identifying sequence thereof. An "identifying sequence" is a contiguous sequence 
of residues at least about 1 0 nt to about 20 nt in length, usually at least about 50 nt to about 

25 100 nt in length, that uniquely identifies a polynucleotide sequence, e.g., exhibits less than 
90%, usually less than about 80% to about 85% sequence identity to any contiguous 
nucleotide sequence of more than about 20 nt. Thus, the subject novel nucleic acid 
compositions include full length cDNAs or mRNAs that encompass an identifying sequence 
of contiguous nucleotides from any one of SEQ ID NOS: 1-844. 

30 The polynucleotides of the invention also include polynucleotides having sequence 

similarity or sequence identity. Nucleic acids having sequence similarity are detected by 
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hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M 

saline/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C in 

1XSSC. Sequence identity can be determined by hybridization under stringent conditions, 

for example, at 50°C or higher and 0.1XSSC (9 mM saline/0.9 mM sodium citrate), 

5 Hybridization methods and conditions are well known in the art, see, e.g., U.S. Patent No. 

5,707,829. Nucleic acids that are substantially identical to the provided polynucleotide 

sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 

provided polynucleotide sequences (SEQ ID NOS: 1-844) under stringent hybridization 

conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate 

10 homologous or related genes. The source of homologous genes can be any species, e.g. 

primate species, particularly human; rodents, such as rats and mice, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. 

Preferably, hybridization is performed using at least 15 contiguous nucleotides of at 
least one of SEQ ID NOS: 1-844. That is, when at least 15 contiguous nucleotides of one of 

1 5 the disclosed SEQ ID NOs. is used as a probe, the probe will preferentially hybridize with a 
gene or mRNA (of the biological material) comprising the complementary sequence, 
allowing the identification and retrieval of the nucleic acids of the biological material that 
uniquely hybridize to the selected probe. Probes from more than one SEQ ID NO. will 
hybridize with the same gene or mRNA if the cDNA from which they were derived 

20 corresponds to one mRNA. Probes of more than 15 nucleotides can be used, but 15 
nucleotides represents enough sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences (e.g., degenerate variants, allelic variants, etc.). Variants of the 
polynucleotides of the invention are identified by hybridization of putative variants with 

25 nucleotide sequences disclosed herein, preferably by hybridization under stringent conditions 
For example, by using appropriate wash conditions, variants of the polynucleotides of the 
invention can be identified where the allelic variant exhibits at most about 25-30% base pair 
mismatches relative to the selected polynucleotide probe. In general, allelic variants contain 
15-25% base pair mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% 

30 base pair mismatches, as well as a single base-pair mismatch. 
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The invention also encompasses homologs corresponding to the polynucleotides of 

SEQ ID NOS: 1-844, where the source of homologous genes can be any mammalian species, 

e.g., primate species, particularly human; rodents, such as rats, canines, felines, bovines, 

ovines, equines, yeast, nematodes, etc. Between mammalian species, e.g., human and 

5 mouse, homologs have substantial sequence similarity, e.g., at least 75% sequence identity, 

usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence 

similarity is calculated based on a reference sequence, which may be a subset of a larger 

sequence, such as a conserved motif, coding region, flanking region, etc. A reference 

sequence will usually be at least about 18 contiguous nt long, more usually at least about 30 

10 nt long, and may extend to the complete sequence that is being compared. Algorithms for 
sequence analysis are known in the art, such as BLAST, described in Altschul et aL, J. Mol 
Biol. (1990) 275:403-10. 

In general, variants of the invention have a sequence identity greater than at least 
about 65%, preferably at least about 75%, more preferably at least about 85%, and can be 

1 5 greater than at least about 90% or more as determined by the Smith- Waterman homology 
search algorithm as implemented in MPSRCH program (Oxford Molecular). For the 
purposes of this invention, a preferred method of calculating percent identity is the Smith- 
Waterman algorithm, using the following. Global DNA sequence identity must be greater 
than 65% as determined by the Smith- Waterman homology search algorithm as implemented 

20 in MPSRCH program (Oxford Molecular) using an affine gap search with the following 
search parameters: gap open penalty, 12; and gap extension penalty, 1. 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments 
thereof, particularly fragments that encode a biologically active gene product and/or are 
useful in the methods disclosed herein (e.g., in diagnosis, as a unique identifier of a 

25 differentially expressed gene of interest, etc.). The term "cDNA" as used herein is intended 
to include all nucleic acids that share the arrangement of sequence elements found in native 
mature mRNA species, where sequence elements are exons and 3 and 5 non-coding 
regions. Normally mRNA species have contiguous exons, with the intervening introns, 
when present, being removed by nuclear RNA splicing, to create a continuous open reading 

30 frame encoding a polypeptide of the invention. 
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A genomic sequence of interest comprises the nucleic acid present between the 

initiation codon and the stop codon, as defined in the listed sequences, including all of the 

introns that are normally present in a native chromosome. It can further include the 3 and 5 

untranslated regions found in the mature mRNA. It can further include specific 

5 transcriptional and translational regulatory sequences, such as promoters, enhancers, etc., 

including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 and 3 end 

of the transcribed region. The genomic DNA can be isolated as a fragment of 1 00 kbp or 

smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 

flanking the coding region, either 3 and 5 , or internal regulatory sequences as sometimes 

1 0 found in introns, contains sequences required for proper tissue, stage-specific, or disease- 
state specific expression. 

The nucleic acid compositions of the subject invention can encode all or a part of the 
subject differentially expressed polypeptides. Double or single stranded fragments can be 
obtained from the DNA sequence by chemically synthesizing oligonucleotides in accordance 

1 5 with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. 
Isolated polynucleotides and polynucleotide fragments of the invention comprise at least 
about 10, about 15, about 20, about 35, about 50, about 100, about 150 to about 200, about 
250 to about 300, or about 350 contiguous nucleotides selected from the polynucleotide 
sequences as shown in SEQ ID NOS: 1-844. For the most part, fragments will be of at least 

20 1 5 nt, usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in length or 
more. In a preferred embodiment, the polynucleotide molecules comprise a contiguous 
sequence of at least twelve nucleotides selected from the group consisting of the 
polynucleotides shown in SEQ ID NOS: 1-844. 

Probes specific to the polynucleotides of the invention can be generated using the 

25 polynucleotide sequences disclosed in SEQ ID NOS: 1 -844. The probes are preferably at 
least about 12, 15, 16, 18, 20, 22, 24, or 25 nucleotide fragment of a corresponding 
contiguous sequence of SEQ ID NOS: 1-844, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb in 
length. The probes can be synthesized chemically or can be generated from longer 
polynucleotides using restriction enzymes. The probes can be labeled, for example, with a 

30 radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based upon an 
identifying sequence of a polynucleotide of one of SEQ ID NOS: 1-844. More preferably, 
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probes are designed based on a contiguous sequence of one of the subject polynucleotides 

that remain unmasked following application of a masking program for masking low 

complexity (e.g., XBLAST) to the sequence., Le. y one would select an unmasked region, as 

indicated by the polynucleotides outside the poly-n stretches of the masked sequence 

5 produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in substantial 

purity, generally as other than an intact chromosome. Usually, the polynucleotides, either as 

DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic acid 

sequences, generally being at least about 50%, usually at least about 90% pure and are 

1 0 typically "recombinant", e.g. , flanked by one or more nucleotides with which it is not 
normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule. They can be provided within autonomously replicating molecules 
(vectors) or within molecules without replication sequences. They can be regulated by their 

1 5 own or by other regulatory sequences, as is known in the art. The polynucleotides of the 
invention can be introduced into suitable host cells using a variety of techniques which are 
available in the art, such as transferrin polycation-mediated DNA transfer, transfection with 
naked or encapsulated nucleic acids, liposome-mediated DNA transfer, intracellular 
transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, 

20 gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological samples 
{e.g., extracts of human cells) to generate additional copies of the polynucleotides, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes or as 

25 triple-strand forming oligonucleotides. The probes described herein can be used to, for 
example, determine the presence or absence of the polynucleotide sequences as shown in 
SEQ ID NOS: 1 -844 or variants thereof in a sample. These and other uses are described in 
more detail below. 
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Use of Polynucleotides to Obtain Full-Length cDNA and Full-Length Human Gene and 

Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are obtained 
as follows. A polynucleotide having a sequence of one of SEQ ID NOS:l-844, or a portion 
5 thereof comprising at least 12, 1 5, 18, or 20 nucleotides, is used as a hybridization probe to 
detect hybridizing members of a cDNA library using probe design methods, cloning 
methods, and clone selection techniques such as those described in U.S. Patent No. 
5,654,173. Libraries of cDNA are made from selected tissues, such as normal or tumor 
tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent. 

10 Preferably, the tissue is the same as the tissue from which the polynucleotides of the 
invention were isolated, as both the polynucleotides described herein and the cDNA 
represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. Alternatively, many cDNA libraries are available 
commercially. (Sambrook et al. 9 Molecular Cloning: A Laboratory Manual, 2nd Ed, (1 989) 

15 Cold Spring Harbor Press, Cold Spring Harbor, NY). The choice of cell type for library 

construction can be made after the identity of the protein encoded by the gene corresponding 
to the polynucleotide of the invention is known. This will indicate which tissue and cell 
types are likely to express the related gene, and thus represent a suitable source for the 
mRNA for generating the cDNA. Where the provided polynucleotides are isolated from 

20 cDNA libraries, the libraries are prepared from mRNA of human colon cells, more 

preferably, human colon cancer cells, even more preferably, from a highly metastatic colon 
cell, Kml2L4-A. 

Techniques for producing and probing nucleic acid sequence libraries are described, 
for example, in Sambrook et al, Molecular Cloning: A Laboratory Manual 2nd Ed, (1989) 
25 Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using 
primers based on sequence from SEQ ID NOS: 1-844. In one embodiment, the cDNA library 
can be made from only poly-adenylated mRNA. Thus, poly-T primers can be used to 
prepare cDNA from the mRNA. 

Members of the library that are larger than the provided polynucleotides, and 
30 preferably that encompass the complete coding sequence of the native message, are obtained. 
In order to confirm that the entire cDNA has been obtained, RNA protection experiments 

10 
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are performed as follows. Hybridization of a full-length cDNA to an mRNA will protect the 

RNA from RNase degradation. If the cDNA is not full length, then the portions of the 

mRNA that are not hybridized will be subject to RNase degradation. This is assayed, as is 

known in the art, by changes in electrophoretic mobility on polyacrylamide gels, or by 

5 detection of released monoribonucleotides. Sambrook et a/., Molecular Cloning: A 

Laboratory Manual, 2nd Ed y {\ 989) Cold Spring Harbor Press, Cold Spring Harbor, NY. In 

order to obtain additional sequences 5' to the end of a partial cDNA, 5' RACE (PCR 

Protocols: A Guide to Methods and Applications, (1990) Academic Press, Inc.) is performed. 

Genomic DNA is isolated using the provided polynucleotides in a manner similar to 

1 0 the isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions 

thereof, are used as probes to libraries of genomic DNA. Preferably, the library is obtained 
from the cell type that was used to generate the polynucleotides of the invention, but this is 
not essential. Most preferably, the genomic DNA is obtained from the biological material 
described herein in the Examples. Such libraries can be in vectors suitable for carrying large 

15 segments of a genome, such as PI or YAC, as described in detail in Sambrook et al % 9.4- 
9.30. In addition, genomic sequences can be isolated from human BAC libraries, which are 
commercially available from Research Genetics, Inc., Huntville, Alabama, USA, for 
example. In order to obtain additional 5 1 or 3' sequences, chromosome walking is performed, 
as described in Sambrook et al. 9 such that adjacent and overlapping fragments of genomic 

20 DNA are isolated. These are mapped and pieced together, as is known in the art, using 
restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes 
can be isolated using both classical and PCR methods to construct and probe cDNA libraries. 
Using either method, Northern blots, preferably, are performed on a number of cell types to 

25 determine which cell lines express the gene of interest at the highest level. Classical 

methods of constructing cDNA libraries are taught in Sambrook et ai, supra. With these 
methods, cDNA can be produced from mRNA and inserted into viral or expression vectors. 
Typically, libraries of mRNA comprising poly(A) tails can be produced with poly(T) 
primers. Similarly, cDNA libraries can be produced using the instant sequences as primers. 

30 PCR methods are used to amplify the members of a cDNA library that comprise the 

desired insert. In this case, the desired insert will contain sequence from the full length 

11 
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cDNA that corresponds to the instant polynucleotides. Such PCR methods include gene 

trapping and RACE methods. Gene trapping entails inserting a member of a cDNA library 

into a vector. The vector then is denatured to produce single stranded molecules. Next, a 

substrate-bound probe, such a biotinylated oligo, is used to trap cDNA inserts of interest. 

5 Biotinylated probes can be linked to an avidin-bound solid substrate. PCR methods can be 

used to amplify the trapped cDNA. To trap sequences corresponding to the full length 

genes, the labeled probe sequence is based on the polynucleotide sequences of the invention. 

Random primers or primers specific to the library vector can be used to amplify the trapped 

cDNA. Such gene trapping techniques are described in Gruber et al, WO 95/04745 and 

10 Gruber et ai, U.S. Pat. No. 5,500,356. Kits are commercially available to perform gene 
trapping experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying 
cDNAs from a number of different RNAs. The cDNAs are ligated to an oligonucleotide 
linker, and amplified by PCR using two primers. One primer is based on sequence from the 

15 instant polynucleotides, for which full length sequence is desired, and a second primer 
comprises sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. A 
description of this methods is reported in WO 97/191 10. In preferred embodiments of 
RACE, a common primer is designed to anneal to an arbitrary adaptor sequence ligated to 
cDNA ends (Apte and Siebert, Biotechniques (1993) 75:890-893; Edwards et aL, Nuc. Acids 

20 Res. (1 99 1 ) 79:5227-5232). When a single gene-specific RACE primer is paired with the 
common primer, preferential amplification of sequences between the single gene specific 
primer and the common primer occurs. Commercial cDNA pools modified for use in RACE 
are available. 

Another PCR-based method generates full-length cDNA library with anchored ends 
25 without needing specific knowledge of the cDNA sequence. The method uses lock-docking 
primers (I- VI), where one primer, poly TV (I-III) locks over the polyA tail of eukaryotic 
mRNA producing first strand synthesis and a second primer, polyGH (IV-VI) locks onto the 
polyC tail added by terminal deoxynucleotidyl transferase (TdT). This method is described 
in WO 96/40998. 

30 The promoter region of a gene generally is located 5' to the initiation site for RNA 

polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as 

12 
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TATTA or TATAA, which is sensitive to mutations. The promoter region can be obtained 

by performing 5' RACE using a primer from the coding region of the gene. Alternatively, 

the cDNA can be used as a probe for the genomic sequence, and the region 5' to the coding 

region is identified by "walking up." If the gene is highly expressed or differentially 

5 expressed, the promoter from the gene can be of use in a regulatory construct for a 

heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be 
prepared by site-directed mutagenesis, described in detail in Sambrook et aL, 15.3-15.63. 
The choice of codon or nucleotide to be replaced can be based on disclosure herein on 

10 optional changes in amino acids to achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, 
nucleic acid comprising nucleotides having the sequence of one or more polynucleotides of 
the invention can be synthesized. Thus, the invention encompasses nucleic acid molecules 
ranging in length from 15 nucleotides (corresponding to at least 15 contiguous nucleotides of 

1 5 one of SEQ ID NOS: 1 -844) up to a maximum length suitable for one or more biological 
manipulations, including replication and expression, of the nucleic acid molecule. The 
invention includes but is not limited to (a) nucleic acid having the size of a full gene, and 
comprising at least one of SEQ ID NOS: 1-844; (b) the nucleic acid of (a) also comprising at 
least one additional gene, operably linked to permit expression of a fusion protein; (c) an 

20 expression vector comprising (a) or (b); (d) a plasmid comprising (a) or (b) ; and (e) a 
recombinant viral particle comprising (a) or (b). Once provided with the polynucleotides 
disclosed herein, construction or preparation of (a) - (e) are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nucleotides of at 
least any one of SEQ ID NOS: 1-844, preferably the entire sequence of at least any one of 

25 SEQ ID NOS: 1-844, is not limited and can be any sequence of A, T, G, and/or C (for DNA) 
and A, U, G, and/or C (for RNA) or modified bases thereof, including inosine and 
pseudouridine. The choice of sequence will depend on the desired function and can be 
dictated by coding regions desired, the intron-like regions desired, and the regulatory regions 
desired. Where the entire sequence of any one of SEQ ID NOS: 1-844 is within the nucleic 

30 acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 
sequence of any one of SEQ ID NOS: 1-844. 
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II. Expression of Polypeptide Encoded bv Full-Length cDNA or Full-Length Gene 

The provided polynucleotide (e.g., a polynucleotide having a sequence of one of SEQ 
ID NOS: 1-844), the corresponding cDNA, or the full-length gene is used to express a partial 
5 or complete gene product. 

Constructs of polynucleotides having sequences of SEQ ID NOS: 1-844 can be 
generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid 
from large numbers of oligodeoxyribonucleotides is described by, e.g., Stemmer et aL, Gene 
(Amsterdam) (1995) 164(l):49-53. In this method, assembly PCR (the synthesis of long 

10 DNA sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. 
The method is derived from DNA shuffling (Stemmer, Nature (1994) 570:389-391), and 
does not rely on DNA ligase, but instead relies on DNA polymerase to build increasingly 
longer DNA fragments during the assembly process. For example, a 1.1 -kb fragment 
containing the TEM-1 beta-lactamase-encoding gene (bla) can be assembled in a single 

15 reaction from a total of 56 oligos, each 40 nucleotides (nt) in length. The synthetic gene can 
be PCR amplified and cloned in a vector containing the tetracycline-resistance gene (Tc-R) 
as the sole selectable marker. Without relying on ampicillin (Ap) selection, 76% of the Tc-R 
colonies were Ap-R, making this approach a general method for the rapid and cost-effective 
synthesis of any gene. 

20 Appropriate polynucleotide constructs are purified using standard recombinant DNA 

techniques as described in, for example, Sambrook et aL, Molecular Cloning: A Laboratory 
Manual 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under 
current regulations described in United States Dept. of HHS, National Institute of Health 
(NIH) Guidelines for Recombinant DNA Research. The gene product encoded by a 

25 polynucleotide of the invention is expressed in any expression system, including, for 

example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable vectors and 
host cells are described in U.S. Patent No. 5,654,173. 

Bacteria. Expression systems in bacteria include those described in Chang et aL, 
Nature (1978) 275:61 5; Goeddel et aL, Nature (1979) 257:544; Goeddel et aL, Nucleic Acids 

30 Res. (1980) 5:4057; EP 0 036,776; U.S. Patent No. 4,551,433; DeBoer et aL, Proc. Natl. 
Acad. Sci. (USA) (1983) 50:21-25; and Siebenlist et aL, Cell (1980) 20:269. 
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Yeast. Expression systems in yeast include those described in Hinnen et al, Proc. 

Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et al.,J. Bacteriol. (1983) 755:163; Kurtz et al, 

Mol. Cell. Biol. (1986) 5:142; Kunze et al.,J. Basic Microbiol. (1985) 25:141; Gleeson et 

al.,J. Gen. Microbiol. (1986) 752:3459; Roggenkamp et al, Mol. Gen. Genet. (1986) 

5 202:302; Das et al., J. Bacteriol. (1984) 755:1 165; De Louvencourt et al, J. Bacteriol. 

(1983) 154:111; Van den Berg et al, Bio/Technology (1990) 5:135; Kunze et al, J. Basic 

Microbiol. (1985) 25:141; Cregg et al, Mol. Cell. Biol. (1985) 5:3376; U.S. Patent Nos. 

4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:106; Davidow et al, Curr. 

Genet. (1985) 70:380; Gaillardin et al, Curr. Genet. (1985) 70:49; Ballance et al, Biochem. 

10 Biophys. Res. Commun. (1983) 772:284-289; Tilburn etal, Gene (1983) 25:205-221; Yelton 
et al, Proc. Natl. Acad. Sci. (USA) (1984) 57:1470-1474; Kelly and Hynes, EMBOJ. (1985) 
4:415419; EP 0 244,234; and WO 91/00357. 

Insect Cells. Expression of heterologous genes in insects is accomplished as 
described in U.S. Patent No. 4,745,051; Friesen et al, "The Regulation of Baculovirus Gene 

1 5 Expression", in: The Molecular Biology Of Baculoviruses ( 1 986) (W. Doerfler, ed.); EP 0 
127,839; EP 0 155,476; and Vlak et al, J. Gen. Virol. (1988) 69:165-116; Miller et al, Ann. 
Rev. Microbiol. (1988) 42:177; Carbonell et al, Gene (1988) 75:409; Maeda et al, Nature 
(1985) 575:592-594; Lebacq-Verheyden et al., Mol. Cell. Biol (1988) 5:3129; Smith et al, 
Proc. Natl. Acad. Sci. (USA) (1985) 52:8844; Miyajima et al, Gene (1987) 55:273; and 

20 Martin et al, DNA (1988) 7:99. Numerous baculoviral strains and variants and 

corresponding permissive insect host cells from hosts are described in Luckow et al, 
Bio/Technology (1988) 5:47-55, Miller et al, Generic Engineering (1986) 5:277-279, and 
Maeda et al, Nature (1985) 575:592-594. 

Mammalian Cells. Mammalian expression is accomplished as described in Dijkema 

25 et al, EMBO J. (1985) 4:761, Gorman et al, Proc. Natl. Acad. Sci. (USA) (1982) 7P:6777, 
Boshart et al, Cell (1985) 47:521 and U.S. Patent No. 4,399,216. Other features of 
mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. (1979) 
55:44, Barnes and Sato, Anal. Biochem. (1980) 702:255, U.S. Patent Nos. 4,767,704, 
4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 30,985. 

30 Polynucleotide molecules comprising a polynucleotide sequence provided herein 

propagated by placing the molecule in a vector. Viral and non-viral vectors are used, 
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including plasmids. The choice of plasmid will depend on the type of cell in which 

propagation is desired and the purpose of propagation. Certain vectors are useful for 

amplifying and making large amounts of the desired DNA sequence. Other vectors are 

suitable for expression in cells in culture. Still other vectors are suitable for transfer and 

5 expression in cells in a whole animal or person. The choice of appropriate vector is well 

within the skill of the art. Many such vectors are available commercially. The partial or 

full-length polynucleotide is inserted into a vector typically by means of DNA ligase 

attachment to a cleaved restriction enzyme site in the vector. Alternatively, the desired 

nucleotide sequence can be inserted by homologous recombination in vivo. Typically this is 

10 accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a 
portion of the desired nucleotide sequence, for example. 

The polynucleotides set forth in SEQ ID NOS: 1-844 or their corresponding full- 

15 length polynucleotides are linked to regulatory sequences as appropriate to obtain the desired 
expression properties. These can include promoters (attached either at the 5 1 end of the sense 
strand or at the 3' end of the antisense strand), enhancers, terminators, operators, repressors, 
and inducers. The promoters can be regulated or constitutive. In some situations it may be 
desirable to use conditionally active promoters, such as tissue-specific or developmental 

20 stage-specific promoters. These are linked to the desired nucleotide sequence using the 
techniques described above for linkage to vectors. Any techniques known in the art can be 
used. 

When any of the above host cells, or other appropriate host cells or organisms, are 
used to replicate and/or express the polynucleotides or nucleic acids of the invention, the 
25 resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope 
of the invention as a product of the host cell or organism. The product is recovered by any 
appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression 
can be regulated in the cell to which the gene is native. For example, an endogenous gene of 
30 a cell can be regulated by an exogenous regulatory sequence as disclosed in U.S. Patent No. 
5,641,670. 



WO 99/33982 PCT/US98/27610 

HI. Identification of Functional and Structural Motifs of Novel Genes 

A. Screening Polynucleotide Sequences and Amino Acid Sequences Against 
Publicly Available Databases 
5 Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or 

full genes can be aligned with individual known sequences. Similarity with individual 
sequences can be used to determine the activity of the polypeptides encoded by the 
polynucleotides of the invention. For example, sequences that show similarity with a 
chemokine sequence can exhibit chemokine activities. Also, sequences exhibiting similarity 

10 with more than one individual sequence can exhibit activities that are characteristic of either 
or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the 
nearest neighbors can be used as probes and primers to identify and isolate the full length 
sequence corresponding to provided polynucleotides. The nearest neighbors can indicate a 

1 5 tissue or cell type to be used to construct a library for the full-length sequences 
corresponding to the provided polynucleotides.. 

Typically, a selected polynucleotide is translated in all six frames to determine the 
best alignment with the individual sequences. The sequences disclosed herein in the 
Sequence Listing are in a 5' to 3' orientation and translation in three frames can be sufficient 

20 (with a few specific exceptions as described in the Examples). These amino acid sequences 
are referred to, generally, as query sequences, which will be aligned with the individual 
sequences. Databases with individual sequences are described in "Computer Methods for 
Macromolecular Sequence Analysis" Methods in Enzymology (1996) 266 9 Doolittle, 
Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. 

25 Databases include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and computer 
programs described above, and include BLAST, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BLAST/ . Another alignment algorithm is Fasta, available in the 
Genetics Computing Group (GCG) package, Madison, Wisconsin, USA, a wholly owned 

30 subsidiary of Oxford Molecular Group, Inc. Other techniques for alignment are described in 
Doolittle, supra. Preferably, an alignment program that permits gaps in the sequence is 
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utilized to align the sequences. The Smith- Waterman is one type of algorithm that permits 

gaps in sequence alignments. See Meth Mol Biol (1997) 70: 173-187. Also, the GAP 

program using the Needleman and Wunsch alignment method can be utilized to align 

sequences. An alternative search strategy uses MPSRCH software, which runs on a 

5 MASPAR computer. MPSRCH uses a Smith- Waterman algorithm to score sequences on a 

massively parallel computer. This approach improves ability to identify sequences that are 

distantly related matches, and is especially tolerant of small gaps and nucleotide sequence 

errors. Amino acid sequences encoded by the provided polynucleotides can be used to 

search both protein and DNA databases. 

10 Results of individual and query sequence alignments can be divided into three 

categories, high similarity, weak similarity, and no similarity. Individual alignment results 
ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. Parameters for categorizing individual results include: percentage 
of the alignment region length where the strongest alignment is found, percent sequence 

1 5 identity, and p value. 

The percentage of the alignment region length is calculated by counting the number 
of residues of the individual sequence found in the region of strongest alignment, e.g., 
contiguous region of the individual sequence that contains the greatest number of residues 
that are identical to the residues of the corresponding region of the aligned query sequence. 

20 This number is divided by the total residue length of the query sequence to calculate a 

percentage. For example, a query sequence of 20 amino acid residues might be aligned with 
a 20 amino acid region of an individual sequence. The individual sequence might be 
identical to amino acid residues 5, 9-15, and 17-19 of the query sequence. The region of 
strongest alignment is thus the region stretching from residue 9-19, an 1 1 amino acid stretch. 

25 The percentage of the alignment region length is: 1 1 (length of the region of strongest 
alignment) divided by (query sequence length) 20 or 55%. 

Percent sequence identity is calculated by counting the number of amino acid 
matches between the query and individual sequence and dividing total number of matches by 
the number of residues of the individual sequences found in the region of strongest 

30 alignment. Thus, the percent identity in the example above would be 10 matches divided by 
1 1 amino acids, or approximately, 90.9% 
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P value is the probability that the alignment was produced by chance. For a single 

alignment, the p value can be calculated according to Karlin et al 9 Proc. Natl. Acad. Set 

(1990) 57:2264 and Karlin et al, Proc. Natl Acad. Sci. (1993) 90. The p value of multiple 

alignments using the same query sequence can be calculated using an heuristic approach 

5 described in Altschul et al 9 Nat. Genet. (1994) 6:1 19. Alignment programs such as BLAST 

program can calculate the p value. 

Another factor to consider for determining identity or similarity is the location of the 
similarity or identity. Strong local alignment can indicate similarity even if the length of 
alignment is short. Sequence identity scattered throughout the length of the query sequence 
10 also can indicate a similarity between the query and profile sequences. The boundaries of 
the region where the sequences align can be determined according to Doolittle, supra; 
BLAST or FAST programs; or by determining the area where sequence identity is highest. 

High Similarity. In general, in alignment results considered to be of high similarity, 
the percent of the alignment region length is typically at least about 55% of total length 
15 query sequence; more typically, at least about 58%; even more typically; at least about 60% 
of the total residue length of the query sequence. Usually, percent length of the alignment 
region can be as much as about 62%; more usually, as much as about 64%; even more 
usually, as much as about 66%. Further, for high similarity, the region of alignment, 
typically, exhibits at least about 75% of sequence identity; more typically, at least about 
20 78%; even more typically; at least about 80% sequence identity. Usually, percent sequence 
identity can be as much as about 82%; more usually, as much as about 84%; even more 
usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is found, 
the query sequence is considered to have high similarity with a profile sequence when the p 
25 value is less than or equal to about 10 2 ; more usually; less than or equal to about 10 3 ; even 
more usually; less than or equal to about 10" 4 . More typically, the p value is no more than 
about 10* 5 ; more typically; no more than or equal to about 10' 10 ; even more typically; no 
more than or equal to about 10" 15 for the query sequence to be considered high similarity. 

Weak Similarity. In general, where alignment results considered to be of weak 
30 similarity, there is no minimum percent length of the alignment region nor minimum length 
of alignment. A better showing of weak similarity is considered when the region of 
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alignment is, typically, at least about 15 amino acid residues in length; more typically, at 

least about 20; even more typically; at least about 25 amino acid residues in length. Usually, 

length of the alignment region can be as much as about 30 amino acid residues; more 

usually, as much as about 40; even more usually, as much as about 60 amino acid residues. 

5 Further, for weak similarity, the region of alignment, typically, exhibits at least about 35% of 

sequence identity; more typically, at least about 40%; even more typically; at least about 

45% sequence identity. Usually, percent sequence identity can be as much as about 50%; 

more usually, as much as about 55%; even more usually, as much as about 60%. 

If low similarity is found, the query sequence is considered to have weak similarity 

1 0 with a profile sequence when the p value is usually less than or equal to about 1 0* 2 ; more 
usually; less than or equal to about 10" 3 ; even more usually; less than or equal to about 10^. 
More typically, the p value is no more than about 10" 5 ; more usually; no more than or equal 
to about 10* 10 ; even more usually; no more than or equal to about 10" 15 for the query sequence 
to be considered weak similarity. 

15 Similarity Determined by Sequence Identity Alone. Sequence identity alone can be 

used to determine similarity of a query sequence to an individual sequence and can indicate 
the activity of the sequence. Such an alignment, preferably, permits gaps to align sequences. 
Typically, the query sequence is related to the profile sequence if the sequence identity over 
the entire query sequence is at least about 15%; more typically, at least about 20%; even 

20 more typically, at least about 25%; even more typically, at least about 50%. Sequence 

identity alone as a measure of similarity is most useful when the query sequence is usually, 
at least 80 residues in length; more usually, 90 residues; even more usually, at least 95 amino 
acid residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more preferably, 

25 120 residues in length; even more preferably, 150 amino acid residues in length. 

Determining Activity from Alignments with Profile and Multiple Aligned Sequences. 
Translations of the provided polynucleotides can be aligned with amino acid profiles that 
define either protein families or common motifs. Also, translations of the provided 
polynucleotides can be aligned to multiple sequence alignments (MSA) comprising the 

30 polypeptide sequences of members of protein families or motifs. Similarity or identity with 
profile sequences or MS As can be used to determine the activity of the gene products (e.g., 
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polypeptides) encoded by the provided polynucleotides or corresponding cDNA or genes. 

For example, sequences that show an identity or similarity with a chemokine profile or MSA 

can exhibit chemokine activities. 

Profiles can designed manually by (1) creating an MSA, which is an alignment of the 

5 amino acid sequence of members that belong to the family and (2) constructing a statistical 

representation of the alignment. Such methods are described, for example, in Birney et aL, 

Nucl Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein families and motifs are 

publicly available. For example, http://genome.wustl.edu/Pfam/ includes MSAs of 547 

different families and motifs. These MSAs are described also in Sonnhammer et aL, 

10 Proteins (1997) 28: 405-420. Other sources over the world wide web include the site at 
http://www.embl-heidelberg.de/argos/ali/ali.htm 1 : alternatively, a message can be sent to 
ALI@.EMBL-HEIDELBERG.DE for the information. A brief description of these MSAs is 
reported in Pascarella et aL, Prot. Eng. (1996) 9(3):249-25\. Techniques for building 
profiles from MSAs are described in Sonnhammer et aL, supra; Bimey et aL, supra; and 

15 "Computer Methods for Macromolecular Sequence Analysis," Methods in Enzymology 

(1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. 

Similarity between a query sequence and a protein family or motif can be determined 
by (a) comparing the query sequence against the profile and/or (b) aligning the query 
20 sequence with the members of the family or motif. Typically, a program such as Searchwise 
is used to compare the query sequence to the statistical representation of the multiple 
alignment, also known as a profile. The program is described in Birney et aL, supra. Other 
techniques to compare the sequence and profile are described in Sonnhammer et aL, supra 
and Doolittle, supra. 

25 Next, methods described by Feng et aL, J. Mol. Evol (1987) 25:351 and Higgins et 

aL, CABIOS (1989) 5:151 can be used align the query sequence with the members of a 
family or motif, also known as a MSA. Computer programs, such as PILEUP, can be used. 
See Feng et aL, infra. In general, the following factors are used to determine if a similarity 
between a query sequence and a profile or MSA exists: (1) number of conserved residues 

30 found in the query sequence, (2) percentage of conserved residues found in the query 
sequence, (3) number of frameshifts, and (4) spacing between conserved residues. 
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Some alignment programs that both translate and align sequences can make any 

number of frameshifts when translating the nucleotide sequence to produce the best 

alignment. The fewer frameshifts needed to produce an alignment, the stronger the 

similarity or identity between the query and profile or MSAs. For example, a weak 

5 similarity resulting from no frameshifts can be a better indication of activity or structure of a 

query sequence, than a strong similarity resulting from two frameshifts. Preferably, three or 

fewer frameshifts are found in an alignment; more preferably two or fewer frameshifts; even 

more preferably, one or fewer frameshifts; even more preferably, no frameshifts are found in 

an alignment of query and profile or MSAs. 

1 0 Conserved residues are those amino acids found at a particular position in all or some 

of the family or motif members. For example, most chemokines contain four conserved 
cysteines. Alternatively, a position is considered conserved if only a certain class of amino 
acids is found in a particular position in all or some of the family members. For example, 
the N-terminal position can contain a positively charged amino acid, such as lysine, arginine, 

1 5 or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a 

single amino acid is found at a particular position in at least about 40% of all class members; 

more typically, at least about 50%; even more typically, at least about 60% of the members. 

Usually, a residue is conserved when a class or single amino acid is found in at least about 
20 70% of the members of a family or motif; more usually, at least about 80%; even more 

usually, at least about 90%; even more usually, at least about 95%. 

A residue is considered conserved when three unrelated amino acids are found at a 

particular position in the some or all of the members; more usually, two unrelated amino 

acids. These residues are conserved when the unrelated amino acids are found at particular 
25 positions in at least about 40% of all class member; more typically, at least about 50%; even 

more typically, at least about 60% of the members. Usually, a residue is conserved when a 

class or single amino acid is found in at least about 70% of the members of a family or motif; 

more usually, at least about 80%; even more usually, at least about 90%; even more usually, 

at least about 95%. 

30 A query sequence has similarity to a profile or MSA when the query sequence 

comprises at least about 25% of the conserved residues of the profile or MSA; more usually, 
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at least about 30%; even more usually; at least about 40%. Typically, the query sequence 

has a stronger similarity to a profile sequence or MSA when the query sequence comprises at 

least about 45% of the conserved residues of the profile or MSA; more typically, at least 

about 50%; even more typically; at least about 55%. 

5 B. Screening Polynucleotide and Amino Acid Sequences Against Protein 

Profiles 

The identify and function of the gene that correlates to a polynucleotide described 
herein can be determined by screening the polynucleotides or their corresponding amino acid 
sequences against profiles of protein families. Such profiles focus on common structural 
10 motifs among proteins of each family. Publicly available profiles are described above in 
Section IVA. Additional or alternative profiles are described below. 

In comparing a novel polynucleotide with known sequences, several alignment tools 
are available. Examples include PileUp, which creates a multiple sequence alignment, and is 
described in Feng et al, J. Mol Evol (1987) 25:351. Another method, GAP, uses the 
15 alignment method of Needleman et al, J. Mol Biol (1970) 45:443. GAP is best suited for 
global alignment of sequences. A third method, BestFit, functions by inserting gaps to 
maximize the number of matches using the local homology algorithm of Smith et al , Adv. 
Appl Math (1981) 2:482. Exemplary protein profiles are provided below and in the 
examples. 

20 Chemokines. Chemokines are a family of proteins that have been implicated in 

lymphocyte trafficking, inflammatory diseases, angiogenesis, hematopoiesis, and viral 
infection. See, for example, Rollins, Blood (1997) 90(5J:9O9-928, and Wells et al, 1 Leuk 
Biol (1997) 67:545-550. U.S. Patent No. 5,605,817 discloses DNA encoding a chemokine 
expressed in fetal spleen. U.S. Patent No. 5,656,724 discloses chemokine-like proteins and 

25 methods of use. U.S. Patent No. 5,602,008 discloses DNA encoding a chemokine expressed 
by liver. 

Chemokine mutants are polypeptides having an amino acid sequence that possesses 
at least one amino acid substitution, addition, or deletion as compared to native chemokines. 
Fragments possess the same amino acid sequence of the native chemokines; mutants can 
30 lack the amino and/or carboxyl terminal sequences. Fusions are mutants, fragments, or 

native chemokines that also include amino and/or carboxyl terminal amino acid extensions. 
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The number or type of the amino acid changes is not critical, nor is the length or 

number of the amino acid deletions, or amino acid extensions that are incorporated in the 

chemokines as compared to the native chemokine amino acid sequences. A polynucleotide 

encoding one of these variant polypeptides will retain at least about 80% amino acid identity 

5 with at least one known chemokine. Preferably, these polypeptides will retain at least about 

85% amino acid sequence identity, more preferably, at least about 90%; even more 

preferably, at least about 95%. In addition, the variants exhibit at least 80%; preferably 

about 90%; more preferably about 95% of at least one activity exhibited by a native 

chemokine, which includes immunological, biological, receptor binding, and signal 

1 0 transduction functions. 

Assays for chemotaxis relating to neutrophils are described in Walz et al, Biochem. 

Biophys. Res. Commun. (1987) 149:755, Yoshimura et al, Proc. Natl. Acad. Sci. (USA) 

(1987) 54:9233, and Schroder et al, J. Immunol (1987) 759:3474; to lymphocytes, Larsen et 
al 9 Science (1989) 243:1464, Carr et aL, Proa Natl Acad. Set (USA) (1994) P7:3652; to 

15 tumor-infiltrating lymphocytes, Liao et al, J. Exp. Med (1995). 752:1301; to hematopoietic 
progenitors, Aiuti et al, J. Exp. Med. (1997) 755:1 1 1; to monocytes, Valente et al, Biochem. 

(1988) 27:4162; and to natural killer cells, Loetscher et aL, J. Immunol. (1996) 756:322, and 
Allavena et al, Eur. J. Immunol (1994) 24:3233. 

Assays for determining the biological activity of attracting eosinophils are described 
20 in Dahinden et al, J. Exp. Med (1994) 77P:751, Weber et al, J. Immunol (1995) 754:4166, 
and Noso et al, Biochem. Biophys. Res. Commun. (1 994) 200: 1470; for attracting dendritic 
cells, Sozzani et al, J. Immunol (1995) 755:3292; for attracting basophils, in Dahinden et 
al, J. Exp. Med (1994) 779:751, Alam et al, J. Immunol (1994) 752:1298, Alam et al, J. 
Exp. Med. (1992) 776:781; and for activating neutrophils, Maghazaci et al, Eur. J. Immunol 
25 (1996) 26:315, and Taubef al, J. Immunol (1995)755:3877. Native chemokines can act as 
mitogens for fibroblasts, assayed as described in Mullenbach et al, J. Biol Chem. (1986) 
267:719. 

Native chemokines exhibit binding activity with a number of receptors. Description 
of such receptors and assays to detect binding are described in, for example, Murphy et al, 
30 Science (1991) 255:1280; Combadiere et al, J. Biol Chem. (1995) 270:29671; Daugherty et 
al, J. Exp. Med (1996) 755:2349; Samson et al, Biochem. (1996) 55:3362; Raport et al, J. 
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Biol Chem. (1996) 277:17161; Combadiere etal,J. Leukoc. Biol (1996) 50:147; Babae/ 

al, J. Biol Chem. (1997) 25:14893; Yosida et al, J. Biol Chem. (1997) 272:13803; 

Arvannitakis et al, Nature (1997) 555:347, and other assays are known in the art. 

Assays for kinase activation of chemokines are described by Yen et aL, J. Leukoc. 

5 Biol (1997) 57:529; Dubois et al, J. Immunol (1996) 755:1356; Turner et al, J. Immunol 

(1995) 755:2437. Assays for inhibition of angiogenesis or cell proliferation are described in 
Maione et al 9 Science (1990) 247:11. Glycosaminoglycan production can be induced by 
native chemokines, assayed as described in Castor et al, Proc. Natl Acad. Sci. (USA) (1983) 
80:165. Chemokine-mediated histamine release from basophils is assayed as described in 

10 Dahinden etaL, J. Exp. Med. (1989) 770:1787; and White et al, Immunol Lett. (1989) 
22:151. Heparin binding is described in Luster et al, J. Exp. Med. (1995) 752:219. 

Chemokines can possess dimerization activity, which can be assayed according to 
Burrows etaL, Biochem. (1994) 55:12741; and Zhang etal t Mol Cell Biol (1995) 75:4851. 
Native chemokines can play a role in the inflammatory response of viruses. This activity 

1 5 can be assayed as described in Bleul et al, Nature (1996) 552:829; and Oberlin et al, Nature 

(1996) 552:833. Exocytosis of monocytes can be promoted by native chemokines. The 
assay for such activity is described in Uguccioni et al, Eur. J. Immunol (1995) 25:64. 
Native chemokines also can inhibit hematopoietic stem cell proliferation. The method for 
testing for such activity is reported in Graham et al, Nature (1990) 344:442. 

20 Death Domain Proteins. Several protein families contain death domain motifs 

(Feinstein and Kimchi, TIBS Letters (1995) 20:242). Some death domain containing 
proteins are implicated in cytotoxic intracellular signaling (Cleveland et al, Cell (1995) 
57:479, Pan et al, Science (1997) 276: 111; Duan et al, Nature (1997) 555:86-89, and 
Chinnaiyan et al, Science (1996) 274:990). U.S. Patent No. 5,563,039 describes a protein 

25 homologous to TRADD (Tumor Necrosis Factor Receptor- 1 Associated Death Domain 
containing protein), and modifications of the active domain of TRADD that retain the 
functional characteristics of the protein, as well as apoptosis assays for testing the function of 
such death domain containing proteins. U.S. Patent No. 5,658,883 discloses biologically 
active TGF-B1 peptides. U.S. Patent No. 5,674,734 discloses RIP, which contains a C- 

30 terminal death domain and an N-terminal kinase domain. 
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Leukemia Inhibitory Factor fLIFV An LIF profile is constructed from sequences of 

leukemia inhibitor factor, CT-1 (cardiotrophin-1), CNTF (ciliary neurotrophic factor), OSM 

(oncostatin M), and IL-6 (interleukin-6). This profile encompasses a family of secreted 

cytokines that have pleiotropic effects on many cell types including hepatocytes, osteoclasts, 

5 neuronal cells and cardiac myocytes, and can be used to detect additional genes encoding 

such proteins. These molecules are all structurally related and share a common co-receptor 

gpl30 which mediates intracellular signal transduction by cytoplasmic tyrosine kinases such 

as src. 

Novel proteins related to this family are also likely to be secreted, to activate gpl30 

1 0 and to function in the development of a variety of cell types. Thus new members of this 
family would be candidates to be developed as growth or survival factors for the cell types 
that they stimulate. For more details on this family of cytokines, see Pennica et al 9 Cytokine 
and Growth Factor Reviews (1996) 7:81-91. U.S. Patent No. 5,420,247 discloses LIF 
receptor and fusion proteins. U.S. Patent No. 5,443,825 discloses human LIF. 

15 Angiopoietin. Angiopoietin- 1 is a secreted ligand of the TIE-2 tyrosine kinase; it 

functions as an angiogenic factor critical for normal vascular development. Angiopoietin-2 is 
a natural antagonist of angiopoietin-1 and thus functions as an anti-angiogenic factor. These 
two proteins are structurally similar and activate the same receptor (Folkman et ai, Cell 
(1996) 57:1 153, and Davis et al 9 Cell (1996) 57:1161). The angiopoietin molecules are 

20 composed of two domains: a coiled-coil region and a region related to fibrinogen. The 
fibrinogen domain is found in many molecules including ficolin and tesascin, and is well 
defined structurally with many members. 

Receptor Protein-Tvrosine Kinases. Receptor Protein-Tyrosine Kinases or RPTKs 
are described in Lindberg, Annu. Rev. Cell Biol (1994) 70:251-337. 

25 Growth Factors: (Epidermal Growth Factor) EGF and (Fibroblast Growth Factor) 

FGF. For a discussion of growth factor superfamilies, see Growth Factors: A Practical 
Approach, (Appendix Al) (1993) McKay and Leigh, Oxford University Press, NY, 237-243. 
U.S. Patent No. 4,444,760 discloses acidic brain fibroblast growth factor, which is active in 
the promotion of cell division and wound healing. U.S. Patent No. 5,439,818 discloses DNA 

30 encoding human recombinant basic fibroblast growth factor, which is active in wound 
healing. U.S. Patent No. 5,604,293 discloses recombinant human basic fibroblast growth 
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factor, which is useful for wound healing. U.S. Patent No. 5,410,832 discloses brain-derived 

and recombinant acidic fibroblast growth factor, which act as mitogens for mesoderm and 

neuroectoderm-derived cells in culture, and promote wound healing in soft tissue, 

cartilaginous tissue and musculoskeletal tissue. U.S. Patent No. 5,387,673 discloses 

5 biologically active fragments of FGF. 

Proteins of the TNF Family. A profile derived from the TNF family is created by 

aligning sequences of the following TNF family members: nerve growth factor (NGF), 

lymphotoxin, Fas ligand, tumor necrosis factor (TNFa), CD40 ligand, TRAIL, ox40 ligand, 

4- IBB ligand, CD27 ligand, and CD30 ligand. The profile is designed to identify sequences 

1 0 of proteins that constitute new members or homologues of this family of proteins. U.S. 
Patent No. 5,606,023 discloses mutant TNF proteins; U.S. Patent No. 5,597,899 and U.S. 
Patent No. 5,486,463 disclose TNF muteins; and U.S. Patent No. 5,652,353 discloses DNA 
encoding TNFa muteins. 

Members of the TNF family of proteins have been show in vitro to multimerize, as 

15 described in Burrows et al, Biochem. (1994) 53:12741 and Zhang et al, Mol Cell Biol 
(1995) 75:4851 and bind receptors as described in Browning et al, J. Immunol (1994) 
747:1230, Androlewicz et al, J, Biol C7*em.(1992) 257:2542, and Crowe et al, Science 
(1994) 264:707. 

In vivo, TNFs proteolytically cleave a target protein as described in Kriegel et al, 
20 Cell (1988) 55:45 and Mohler et al, Nature (1994) 370:218 and demonstrate cell 

proliferation and differentiation activity. T-cell or thymocyte proliferation is assayed as 
described in Armitage et aL, Eur. 1 Immunol (1992) 22:447; Current Protocols in 
Immunology, ed. J.E. Coligan et al, 3.1-3.19; Takai et al, J. Immunol (1986) 737:3494- 
3500, Bertagnoli et al, J. Immunol (1990) 145:1706, Bertagnoli et al, J. Immunol (1991) 
25 733:327, Bertagnoli et al, J. Immunol (1992) 149:377%, and Bowman et al, J. Immunol 
(1994) 752:1756. B cell proliferation and Ig secretion are assayed as described in 
Maliszewski, J. Immunol (1990) 744:3028, and Assays for B Cell Function: In Vitro 
Antibody Production . Mond and Brunswick, Current Protocols in Immunol., Coligan Ed vol 
1 pp 3.8.1-3.8.16, John Wiley and Sons, Toronto 1994, Kehrl etal, Science (1987) 235:1 144 
30 and Boussiotis et al , PNAS USA (1 994) 91 :7007. Other in vivo activities include 

upregulation of cell surface antigens, upregulation of costimulatory molecules, and cellular 
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aggregation/adhesion as described in Barrett et al,J. Immunol (1991) 746:1722; Bjorck et 

al, Eur. J. Immunol (1993) 25:1771; Clark et al, Annu Rev. Immunol (1991) P:97; 
Ranheim et al, J. Exp. Med. (1994) 777:925; Yellin, J. Immunol. (1994) 755:666; and Gruss 
etal, Blood (1994) 54:2305. 
5 Proliferation and differentiation of hematopoietic and lymphopoietic cells has also 

been shown in vivo for TNFs, using assays for embryonic differentiation and hematopoiesis 
as described in Johansson et al, Cellular Biology (1995) 75:141, Keller et al, Mol Cell 
Biol (1993) 75:473, McClanahan et al, Blood (1993) 57:2903 and using assays to detect 
stem cell survival and differentiation as described in Culture of Hematopoietic Cells, 

10 Freshney et al eds, pp 1-21, 23-29, 139-162, 163-179, and 265-268, Wiley-Liss, Inc., New 
York, NY, 1994, and Hirajama et al, PNAS USA (1992) 5P:5907. 

In vivo activities of TNFs also include lymphocyte survival and apoptosis, assayed as 
described in Darzynkewicz et al, Cytometry (1992) 75:795; Gorczca et al, Leukemia (1993) 
7:659; Itoh et al, Cell (1991) 66:233; Zacharduk, J. Immunol (1990) 745:4037; Zamai et 

15 al, Cytometry (1993) 74:891; and Gorczyca et al, Int'U. Oncol (1992) 7:639. Some 
members of the TNF family are cleaved from the cell surface; others remain membrane 
bound. The three-dimensional structure of TNF is discussed in Sprang and Eck, Tumor 
Necrosis Factors; supra. 

TNF proteins include a transmembrane domain. The protein is cleaved into a shorter 

20 soluble version, as described in Kriegler et al, Cell (1988) 55:45, Perez et al, Cell (1990) 
55:251, and Shaw et al, Cell (1986) 45:659. The transmembrane domain is between amino 
acid 46 and 77 and the cytoplasmic domain is between position 1 and 45 on the human form 
of TNFoc. The 3-dimensional motifs of TNF include a sandwich of two pleated p sheets. 
Each sheet is composed of anti-parallel P strands. P strands facing each other on opposite 

25 sites of the sandwich are connected by short polypeptide loops, as described in Van Ostade et 
al, Protein Engineering (1994) 7(I):5, and Sprang et al, Tumor Necrosis Factors; supra. 
Residues of the TNF family proteins that are involved in the p sheet secondary structure 
have been identified as described in Van Ostade et al, Protein Eng. (1994) 7(1):5, and 
Sprang et al, supra. 

30 TNF receptors are disclosed in U.S. Patent No. 5,395,760. A profile derived from the 

TNF receptor family is created by aligning sequences of the TNF receptor family, including 
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Apol/Fas, TNFR I and II, death receptor 3 (DR3), CD40, ox40, CD27, and CD30. Thus, the 

profile is designed to identify from the polynucleotides of the invention sequences of 

proteins that constitute new members or homologues of this family of proteins. 

Tumor necrosis factor receptors exist in two forms in humans: p55 TNFR and p75 

5 TNFR, both of which provide intracellular signals upon binding with a ligand. The 

extracellular domains of these receptor proteins are cysteine rich. The receptors can remain 

membrane bound, although some forms of the receptors are cleaved forming soluble 

receptors. The regulation, diagnostic, prognostic, and therapeutic value of soluble TNF 

receptors is discussed in Aderka, Cytokine and Growth Factor Reviews, (1996) 7(3) :23l. 

10 PDGF Family. U.S. Patent No. 5,326,695 discloses platelet derived growth factor 

agonists; bioactive portions of PDGF-B are used as agonists. U.S. Patent No. 4,845,075 
discloses biologically active B-chain homodimers, and also includes variants and derivatives 
of the PDGF-B chain. U.S. Patent No. 5,128,321 discloses PDGF analogs and methods of 
use. Proteins having the same bioactivity as PDGF are disclosed, including A and B chain 

15 proteins. 

Kinase (Including MKK) Family. U.S. Patent No. 5,650,501 discloses 
serine/threonine kinase, associated with mitotic and meiotic cell division; the protein has a 
kinase domain in its N-terminal and 3 PEST regions in the C-terminus. U.S. Patent No. 
5,605,825 discloses human PAK65, a serine protein kinase. 

20 The foregoing discussion provides a few examples of the protein profiles that can be 

compared with the polynucleotides of the invention. One skilled in the art can use these and 
other protein profiles to identify the genes that correlate with the provided polynucleotides. 
C. Identification of Secreted & Membrane-Bound Polypeptides 
Both secreted and membrane-bound polypeptides of the present invention are of 

25 particular interest. For example, levels of secreted polypeptides can be assayed in body 

fluids that are convenient, such as blood, urine, prostatic fluid and semen. Membrane-bound 
polypeptides are useful for constructing vaccine antigens or inducing an immune response. 
Such antigens would comprise all or part of the extracellular region of the membrane-bound 
polypeptides. Because both secreted and membrane-bound polypeptides comprise a 

30 fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms can 
be used to identify such polypeptides. 
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A signal sequence is usually encoded by both secreted and membrane-bound 

polypeptide genes to direct a polypeptide to the surface of the cell. The signal sequence 

usually comprises a stretch of hydrophobic residues. Such signal sequences can fold into 

helical structures. Membrane-bound polypeptides typically comprise at least one 

5 transmembrane region that possesses a stretch of hydrophobic amino acids that can 

transverse the membrane. Some transmembrane regions also exhibit a helical structure. 

Hydrophobic fragments within a polypeptide can be identified by using computer 

algorithms. Such algorithms include Hopp & Woods A Proc. Natl Acad. Sci. USA (1981) 

75:3824-3828; Kyte & Doolittle, J. Mol Biol. (1982) 157: 105-132; and RAOAR algorithm, 

10 Degli Esposti et al y Eur. 1 Biochem. (1990) 190: 207-219. 

Another method of identifying secreted and membrane-bound polypeptides is to 
translate the polynucleotides of the invention in all six frames and determine if at least 8 
contiguous hydrophobic amino acids are present. Those translated polypeptides with at least 
8; more typically, 10; even more typically, 12 contiguous hydrophobic amino acids are 

1 5 considered to be either a putative secreted or membrane bound polypeptide. Hydrophobic 
amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 
phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 

IV. Identification of the Function of an Expression Product of a Full-Length Gene 

20 Corresponding to a Polynucleotide 

Ribozymes, antisense constructs, and dominant negative mutants can be used to 
determine function of the expression product of a gene corresponding to a polynucleotide 
provided herein. These methods and compositions are particularly useful where the provided 
novel polynucleotide exhibits no significant or substantial homology to a sequence encoding 

25 a gene of known function. Antisense molecules and ribozymes can be constructed from 
synthetic polynucleotides. Typically, the phosphoramidite method of oligonucleotide 
synthesis is used. See Beaucage et aL, Tet. Lett. (1981) 22:1859 and U.S. Patent No. 
4,668,777. Automated devices for synthesis are available to create oligonucleotides using 
this chemistry. Examples of such devices include Biosearch 8600, Models 392 and 394 by 

30 Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, 
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phosphate analog oligonucleotides, and chemically derivatized oligonucleotides can also be 

produced, and can be covalently attached to other molecules. RNA oligonucleotides can be 

synthesized, for example, using RNA phosphoramidites. This method can be performed on 

an automated synthesizer, such as Applied Biosystems, Models 392 and 394, Foster City, 

5 California, USA. See Applied Biosystems User Bulletin 53 and Ogilvie et aL, Pure & 

Applied Chem. (1987) 59:325. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. 
A sulfurizing reagent, such as tetraethylthimam disulfide (TETD) in acetonitrile can be used 
to convert the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 
10 minutes at room temperature. TETD replaces the iodine reagent, while all other reagents 
used for standard phosphoramidite chemistry remain the same. Such a synthesis method can 
be automated using Models 392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nucleotides can be synthesized, more typically, 100 
nucleotides, more typically 50 nucleotides; even more typically 30 to 40 nucleotides. These 
1 5 synthetic fragments can be annealed and ligated together to construct larger fragments. See, 
for example, Sambrook et al. 9 supra. 

A. Ribozvmes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and 

20 the target message must contain a specific nucleotide sequence. They are engineered to 
cleave any RNA species site-specifically in the background of cellular RNA. The cleavage 
event renders the mRNA unstable and prevents protein expression. Importantly, ribozymes 
can be used to inhibit expression of a gene of unknown function for the purpose of 
determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 

25 One commonly used ribozyme motif is the hammerhead, for which the substrate 

sequence requirements are minimal. Design of the hammerhead ribozyme is disclosed in 
Usman et aL, Current Opin. Struct. Biol (1996) 5:527. Usman also discusses the 
therapeutic uses of ribozymes. Ribozymes can also be prepared and used as described in 
Long et al t FASEB J. (1993) 7:25; Symons, Ann. Rev. Biochem. (1992) 57:641 ; Perrotta et 

30 al., Biochem. (1992) 57:16; Ojwang etal, Proc. Natl Acad. Set (USA) (1992) SP:10802; 
and U.S. Patent No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in U.S. 
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Patent No. 5,144,019; methods of cleaving RNA using ribozymes is described in U.S. 

Patent No. 5,1 16,742; and methods for increasing the specificity of ribozymes are described 

in U.S. Patent No. 5,225,337 and Koizumi et aL, Nucleic Acid Res. (1989) 77:7059. 

Preparation and use of ribozyme fragments in a hammerhead structure are also described by 

5 Koizumi et aL, Nucleic Acids Res. (1 989) 1 7:7059. Preparation and use of ribozyme 

fragments in a hairpin structure are described by Chowrira and Burke, Nucleic Acids Res. 

(1992) 20:2835. Ribozymes can also be made by rolling transcription as described in 

Daubendiek and Kool, Nat. Biotechnol (1997) 15(3):273. 

The hybridizing region of the ribozyme can be modified or can be prepared as a 

10 branched structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 1 7:6959. The 
basic structure of the ribozymes can also be chemically altered in ways familiar to those 
skilled in the art, and chemically synthesized ribozymes can be administered as synthetic 
oligonucleotide derivatives modified by monomelic units. In a therapeutic context, liposome 
mediated delivery of ribozymes improves cellular uptake, as described in Birikh et al. y Eur. 

15 J. Biochem. (1997) 245:1. 

Using the polynucleotide sequences of the invention and methods known in the art, 
ribozymes are designed to specifically bind and cut the corresponding mRNA species. 
Ribozymes thus provide a means to inhibit the expression of any of the proteins encoded by 
the disclosed polynucleotides or their full-length genes. The full-length gene need not be 

20 known in order to design and use specific inhibitory ribozymes. In the case of a 

polynucleotide or full-length cDNA of unknown function, ribozymes corresponding to that 
nucleotide sequence can be tested in vitro for efficacy in cleaving the target transcript. 
Those ribozymes that effect cleavage in vitro are further tested in vivo. The ribozyme can 
also be used to generate an animal model for a disease, as described in Birikh et aL, supra. 

25 An effective ribozyme is used to determine the function of the gene of interest by blocking 
its transcription and detecting a change in the cell. Where the gene is found to be a mediator 
in a disease, an effective ribozyme is designed and delivered in a gene therapy for blocking 
transcription and expression of the gene. 

Therapeutic and functional genomic applications of ribozymes proceed beginning 

30 with knowledge of a portion of the coding sequence of the gene to be inhibited. Thus, for 
many genes, a partial polynucleotide sequence provides adequate sequence for constructing 
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an effective ribozyme. A target cleavage site is selected in the target sequence, and a 

ribozyme is constructed based on the 5' and 3' nucleotide sequences that flank the cleavage 

site. Retroviral vectors are engineered to express monomeric and multimeric hammerhead 

ribozymes targeting the mRNA of the target coding sequence. These monomeric and 

5 multimeric ribozymes are tested in vitro for an ability to cleave the target mRNA. A cell line 

is stably transduced with the retroviral vectors expressing the ribozymes, and the 

transduction is confirmed by Northern blot analysis and reverse-transcription polymerase 

chain reaction (RT-PCR). The cells are screened for inactivation of the target mRNA by 

such indicators as reduction of expression of disease markers or reduction of the gene 

1 0 product of the target mRNA. 

B. Antisense 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 

1 5 polynucleotide sequence can interfere with expression of the corresponding gene. Antisense 
polynucleotides are typically generated within the cell by expression from antisense 
constructs that contain the antisense strand as the transcribed strand. Antisense 
polynucleotides based on the disclosed polynucleotides will bind and/or interfere with the 
translation of mRNA comprising a sequence complementary to the antisense polynucleotide. 

20 The expression products of control cells and cells treated with the antisense construct are 
compared to detect the protein product of the gene corresponding to the polynucleotide upon 
which the antisense construct is based. The protein is isolated and identified using routine 
biochemical methods. 

One rationale for using antisense methods to determine the function of the gene 

25 corresponding to a disclosed polynucleotide is the biological activity of antisense 

therapeutics. Antisense therapy for a variety of cancers is in clinical phase and has been 
discussed extensively in the literature. Reed reviewed antisense therapy directed at the Bcl-2 
gene in tumors; gene transfer-mediated overexpression of Bcl-2 in tumor cell lines conferred 
resistance to many types of cancer drugs. (Reed, J.C., N.C.L (1997)59:988). The potential 

30 for clinical development of antisense inhibitors of ras is discussed by Cowsert, L.M., Anti- 
Cancer Drug Design (1997) 72:359. Additional important antisense targets include 
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leukemia (Geurtz, A.M., Anti-Cancer Drug Design (1997) 72:341); human C-ref kinase 

(Monia, B.P., Anti-Cancer Drug Design (1997) 72:327); and protein kinase C (McGraw et 

aL, Anti-Cancer Drug Design (1997) 72:315. 

Given the extensive background literature and clinical experience in antisense 

5 therapy, one skilled in the art can use selected polynucleotides of the invention as additional 

potential therapeutics. The choice of polynucleotide can be narrowed by first testing them 

for binding to "hot spot" regions of the genome of cancerous cells. If a polynucleotide is 

identified as binding to a "hot spot", testing the polynucleotide as an antisense compound in 

the corresponding cancer cells clearly is warranted. 

10 Ogunbiyi et aL, Gastroenterology (1997) 113(3):76l describe prognostic use of 

allelic loss in colon cancer; Barks et aL, Genes, Chromosomes, and Cancer (1997) 19(4)\21% 
describe increased chromosome copy number detected by FISH in malignant melanoma; 
Nishizake et aL, Genes, Chromosomes, and Cancer (1997) 19(4):261 describe genetic 
alterations in primary breast cancer and their metastases and direct comparison using 

1 5 modified comparative genome hybridization; and Elo et aL, Cancer Research (1997) 

57(16):3356 disclose that loss of heterozygosity at 16z24.1-q24.2 is significantly associated 
with metastatic and aggressive behavior of prostate cancer. 
C. Dominant Negative Mutations 

As an alternative method for identifying function of the gene corresponding to a 
20 polynucleotide disclosed herein, dominant negative mutations are readily generated for 

corresponding proteins that are active as homomultimers. A mutant polypeptide will interact 
with wild-type polypeptides (made from the other allele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic domain, or a 
cellular localization domain. Preferably, the mutant polypeptide will be overproduced. 
25 Point mutations are made that have such an effect. In addition, fusion of different 

polypeptides of various lengths to the terminus of a protein can yield dominant negative 
mutants. General strategies are available for making dominant negative mutants (see, e.g, 
Herskowitz, Nature (1987) 329:219). Such techniques can be used to create loss of function 
mutations, which are useful for determining protein function. 

30 



34 



WO 99/33982 PCT/US98/2761 0 

V. Construction of Polypeptides of the Invention and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed 

polynucleotides. These polypeptides can also be encoded by nucleic acids that, by virtue of 

the degeneracy of the genetic code, are not identical in sequence to the disclosed 

5 polynucleotides. Thus, the invention includes within its scope a polypeptide encoded by a 

polynucleotide having the sequence of any one of SEQ ID NOS: 1-844 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 

polypeptide encoded by the recited polynucleotide, the polypeptide encoded by the gene 

represented by the recited polynucleotide, as well as portions or fragments thereof. 

10 "Polypeptides" also includes variants of the naturally occurring proteins, where such 

variants are homologous or substantially similar to the naturally occurring protein, and can 
be of an origin of the same or different species as the naturally occurring protein (e.g., 
human, murine, or some other species that naturally expresses the recited polypeptide, 
usually a mammalian species). In general, variant polypeptides have a sequence that has at 

1 5 least about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide of the invention, as measured by BLAST 
using the parameters described above. The variant polypeptides can be naturally or non- 
naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

20 The invention also encompasses homologs of the disclosed polypeptides (or 

fragments thereof) where the homologs are isolated from other species, i.e. other animal or 
plant species, where such homologs, usually mammalian species, e.g. rodents, such as mice, 
rats; domestic animals, e.g., horse, cow, dog, cat; and humans. By homolog is meant a 
polypeptide having at least about 35%, usually at least about 40% and more usually at least 

25 about 60% amino acid sequence identity a particular differentially expressed protein as 
identified above, where sequence identity is determined using the BLAST algorithm, with 
the parameters described supra. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In 

30 certain embodiments, the subject protein is present in a composition that is enriched for the 
protein as compared to a control. As such, purified polypeptide is provided, where by 



35 



WO 99/33982 PCT/US98/2761 0 

purified is meant that the protein is present in a composition that is substantially free of non- 

differentially expressed polypeptides, where by substantially free is meant that less than 

90%, usually less than 60% and more usually less than 50% of the composition is made up 

of non-differentially expressed polypeptides. 

5 Also within the scope of the invention are variants; variants of polypeptides include 

mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or 

deletions. The amino acid substitutions can be conservative amino acid substitutions or 

substitutions to eliminate non-essential amino acids, such as to alter a glycosylation site, a 

phosphorylation site or an acetylation site, or to minimize misfolding by substitution or 

10 deletion of one or more cysteine residues that are not necessary for function. Conservative 
amino acid substitutions are those that preserve the general charge, 
hydrophobicity/hydrophilicity, and/or steric bulk of the amino acid substituted. For 
example, substitutions between the following groups are conservative: Gly/Ala, Val/Ile/Leu, 
Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys, Thr, and Phe/Trp/Tyr. 

1 5 Variants can be designed so as to retain biological activity of a particular region of 

the protein (e.g., a functional domain and/or, where the polypeptide is a member of a protein 
family, a region associated with a consensus sequence). In a non-limiting example, Osawa et 
al,Biochem. Mol Int. (1994) 54:1003, discusses the actin binding region of a protein from 
several different species. The actin binding regions of the these species are considered 

20 homologous based on the fact that they have amino acids that fall within "homologous 

residue groups." Homologous residues are judged according to the following groups (using 
single letter amino acid designations): STAG; ILVMF; HRK; DEQN; and FYW. For 
example, and S, a T, an A or a G can be in a position and the function (in this case actin 
binding) is retained. 

25 Additional guidance on amino acid substitution is available from studies of protein 

evolution. Go et al, Int. J. Peptide Protein Res. (1980) 75:21 1, classified amino acid residue 
sites as interior or exterior depending on their accessibility. More frequent substitution on 
exterior sites was confirmed to be general in eight sets of homologous protein families 
regardless of their biological functions and the presence or absence of a prosthetic group. 

30 Virtually all types of amino acid residues had higher mutabilities on the exterior than in the 
interior. No correlation between mutability and polarity was observed of amino acid 
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residues in the interior and exterior, respectively. Amino acid residues were classified into 

one of three groups depending on their polarity: polar (Arg, Lys, His, Gin, Asn, Asp, and 

Glu); weak polar (Ala, Pro, Gly, Thr, and Ser), and nonpolar (Cys, Val, Met, He, Leu, Phe, 

Tyr, and Trp). Amino acid replacements during protein evolution were very conservative: 

5 88% and 76% of them in the interior or exterior, respectively, were within the same group of 

the three. Inter-group replacements are such that weak polar residues are replaced more 

often by nonpolar residues in the interior and more often by polar residues on the exterior. 

Additional guidance for production of polypeptide variants is provided in Querol et 

aL, Prot. Eng. (1996) 9:265, which provides general rules for amino acid substitutions to 

10 enhance protein thermostability. New glycosylation sites can be introduced as discussed in 
Olsen and Thomsen, J. Gen. Microbiol (1991) 737:579. An additional disulfide bridge can 
be introduced, as discussed by Perry and Wetzel, Science (1984) 226:555; Pantoliano et aL, 
Biochemistry (1987) 26:2077; Matsumura et aL, Nature (1989) 342:291; Nishikawa et aL, 
Protein Eng. (1990) 3:443; Takagi et aL, J. Biol Chem. (1990) 265:6874; Clarke et aL, 

15 Biochemistry (1993) 32:4322; and Wakarchuk et aL, Protein Eng. (1994) 7:1379. Metal 
binding sites can be introduced, according to Toma et aL, Biochemistry (1991) 30:97, and 
Haezerbrouck et aL, Protein Eng. (1993) 6:643. Substitutions with prolines in loops can be 
made according to Masul et aL, Appl. Env. Microbiol. (1994) 60:3579; and Hardy et aL, 
FEBSLett. 377:89. 

20 Cysteine-depleted muteins are considered variants within the scope of the invention. 

These variants can be constructed according to methods disclosed in U.S. Patent No. 
4,959,314, which discloses substitution of cysteines with other amino acids, and methods 
for assaying biological activity and effect of the substitution. Such methods are suitable for 
proteins according to this invention that have cysteine residues suitable for such 

25 substitutions, for example to eliminate disulfide bond formation. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 10 aa to at least about 15 aa in length, 
usually at least about 50 aa in length, and can be as long as 300 aa in length or longer, but 

30 will usually not exceed about 1 000 aa in length, where the fragment will have a stretch of 
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amino acids that is identical to a polypeptide encoded by a polynucleotide having a 

sequence of any SEQ ID NOS: 1-844, or a homolog thereof. 

The protein variants described herein are encoded by polynucleotides that are within 

the scope of the invention. The genetic code can be used to select the appropriate codons to 

5 construct the corresponding variants. 

VI. Computer-Related Embodiments 

In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of polynucleotide 

1 0 molecules), or in electronic form (e.g. , as a collection of polynucleotide sequences stored in a 
computer-readable form, as in a computer system and/or as part of a computer program). 
The sequence information of the polynucleotides can be used in a variety of ways, e.g., as a 
resource for gene discovery, as a representation of sequences expressed in a selected cell 
type (e.g., cell type markers), and/or as markers of a given disease or disease state. In 

1 5 general, a disease marker is a representation of a gene product that is present in all affected 
by disease either at an increased or decreased level relative to a normal cell (e.g., a cell of the 
same or similar type that is not substantially affected by disease). For example, a 
polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either 

20 overexpressed or underexpressed in a breast ductal cell affected by cancer relative to a 
normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any suitable 
form, e.g., electronic or biochemical forms. For example, a library of sequence information 
embodied in electronic form includes an accessible computer data file (or, in biochemical 

25 form, a collection of nucleic acid molecules) that contains the representative nucleotide 

sequences of genes that are differentially expressed (e.g., overexpressed or underexpressed) 
as between, for example, i) a cancerous cell and a normal cell; ii) a cancerous cell and a 
dysplastic cell; iii) a cancerous cell and a cell affected by a disease or condition other than 
cancer; iv) a metastatic cancerous cell and a normal cell and/or non-metastatic cancerous 

30 cell; v) a malignant cancerous cell and a non-malignant cancerous cell (or a normal cell) 

and/or vi) a dysplastic cell relative to a normal cell. Other combinations and comparisons of 
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cells affected by various diseases or stages of disease will be readily apparent to the 

ordinarily skilled artisan. Biochemical embodiments of the library include a collection of 

nucleic acids that have the sequences of the genes in the library, where the nucleic acids can 

correspond to the entire gene in the library or to a fragment thereof, as described in greater 

5 detail below. 

The polynucleotide libraries of the subject invention include sequence information of 
a plurality of polynucleotide sequences, where at least one of the polynucleotides has a 
sequence of any of SEQ ID NOS: 1-844. By plurality is meant at least 2, usually at least 3 
and can include up to all of SEQ ID NOS: 1-844. The length and number of polynucleotides 

10 in the library will vary with the nature of the library, e.g., if the library is an oligonucleotide 
array, a cDNA array, a computer database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can 
be present in a variety of media. "Media" refers to a manufacture, other than an isolated 
nucleic acid molecule, that contains the sequence information of the present invention. Such 

1 5 a manufacture provides the genome sequence or a subset thereof in a form that can be 

examined by means not directly applicable to the sequence as it exists in a nucleic acid. For 
example, the nucleotide sequence of the present invention, e.g. the nucleic acid sequences of 
any of the polynucleotides of SEQ ID NOS: 1-844, can be recorded on computer readable 
media, e.g. any medium that can be read and accessed directly by a computer. Such media 

20 include, but are not limited to: magnetic storage media, such as a floppy disc, a hard disc 
storage medium, and a magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. One of skill in the art can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 

25 comprising a recording of the present sequence information. "Recorded" refers to a process 
for storing information on computer readable medium, using any such methods as known in 
the art. Any convenient data storage structure can be chosen, based on the means used to 
access the stored information. A variety of data processor programs and formats can be used 
for storage, e.g. word processing text file, database format, etc. In addition to the sequence 

30 information, electronic versions of the libraries of the invention can be provided in 

conjunction or connection with other computer-readable information and/or other types of 
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computer-readable files (e.g., searchable files, executable files, etc, including, but not limited 

to, for example, search program software, etc.). 

By providing the nucleotide sequence in computer readable form, the information can 

be accessed for a variety of purposes. Computer software to access sequence information is 

5 publicly available. For example, the BLAST (Altschul et al. 9 supra.) and BLAZE (Brutlag 

et al Comp. Chem. (1993) 17:203) search algorithms on a Sybase system can be used 

identify open reading frames (ORFs) within the genome that contain homology to ORFs 

from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, software 

10 means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware of the computer-based systems of the present 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. The data 

15 storage means can comprise any manufacture comprising a recording of the present sequence 
information as described above, or a memory access means that can access such a 
manufacture. 

"Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif with the stored sequence 

20 information. Search means are used to identify fragments or regions of the genome that 
match a particular target sequence or target motif. A variety of known algorithms are 
publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any DNA or amino acid sequence of six or 
more nucleotides or two or more amino acids, preferably from about 10 to 100 amino acids 

25 or from about 30 to 300 nucleotide residues. 

A "target structural motif," or "target motif," refers to any rationally selected 
sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration that is formed upon the folding of the target motif, or on 
consensus sequences of regulatory or active sites. There are a variety of target motifs known 

30 in the art. Protein target motifs include, but arc not limited to, enzyme active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, hairpin structures, 
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promoter sequences and other expression elements such as binding sites for transcription 

factors. 

A variety of structural formats for the input and output means can be used to input 
and output the information in the computer-based systems of the present invention. One 
5 format for an output means ranks fragments of the genome possessing varying degrees of 
homology to a target sequence or target motif. Such presentation provides a skilled artisan 
with a ranking of sequences and identifies the degree of sequence similarity contained in the 
identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
1 0 motif with the data storage means to identify sequence fragments of the genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer based systems of the present 
invention. 

As discussed above, the "library" of the invention also encompasses biochemical 

1 5 libraries of the polynucleotides of SEQ ID NOS: 1 -844, e.g., collections of nucleic acids 
representing the provided polynucleotides. The biochemical libraries can take a variety of 
forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably associated with a 
surface of a solid support (i.e., an array) and the like. Of particular interest are nucleic acid 
arrays in which one or more of SEQ ID NOS: 1-844 is represented on the array. By array is 

20 meant a an article of manufacture that has at least a substrate with at least two distinct 

nucleic acid targets on one of its surfaces, where the number of distinct nucleic acids can be 
considerably higher, typically being at least 10 nt, usually at least 20 nt and often at least 25 
nt. A variety of different array formats have been developed and are known to those of skill 
in the art, including those described in 5,242,974; 5,384,261; 5,405,783; 5,412,087; 

25 5,424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5,554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,711; 5,639,603; 5,658,734; WO 
93/17126; WO 95/1 1995; WO 95/35505; EP 742287; and EP 799897. The arrays of the 
subject invention find use in a variety of applications, including gene expression analysis, 
drug screening, mutation analysis and the like, as disclosed in the above-listed exemplary 

3 0 patent documents. 
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In addition to the above nucleic acid libraries, analogous libraries of polypeptides are 

also provided, where the where the polypeptides of the library will represent at least a 

portion of the polypeptides encoded by SEQ ID NOS: 1-844. 

5 VII. Utilities 

A. Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 
Polynucleotide probes, generally comprising at least 12 contiguous nucleotides of a 
polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such as 
chromosome mapping of the polynucleotide and detection of transcription levels. Additional 

10 disclosure about preferred regions of the disclosed polynucleotide sequences is found in the 
Examples. A probe that hybridizes specifically to a polynucleotide disclosed herein should 
provide a detection signal at least 5-, 10-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

Probes in Detection of Expression Levels. Nucleotide probes are used to detect 

15 expression of a gene corresponding to the provided polynucleotide. The references describe 
an example of a sandwich nucleotide hybridization assay. For example, in Northern blots, 
mRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 

20 condition. Probes are also used to detect products of amplification by polymerase chain 
reaction. The products of the reaction are hybridized to the probe and hybrids are detected. 
Probes are used for in situ hybridization to cells to detect expression. Probes can also be 
used in vivo for diagnostic detection of hybridizing sequences. Probes are typically labeled 
with a radioactive isotope. Other types of detectable labels can be used such as 

25 chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PGR) is another means for detecting 
small amounts of target nucleic acids (see, e.g., Mullis et a/., Meth. EnzymoL (1987) 
755:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). Two primer 

30 polynucleotides nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can be composed of sequence within or 3* and 5' to the 
polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to these 
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polynucleotides, they need not hybridize to them or the complements. A thermostable 

polymerase creates copies of target nucleic acids from the primers using the original target 

nucleic acids as a template. After a large amount of target nucleic acids is generated by the 

polymerase, it is detected by methods such as Southern blots. When using the Southern blot 

5 method, the labeled probe will hybridize to a polynucleotide of the Sequence Listing or 

complement. 

Furthermore, mRNA or cDNA can be detected by traditional blotting techniques 
described in Sambrook et ah, "Molecular Cloning: A Laboratory Manual" (New York, Cold 
Spring Harbor Laboratory, 1989). mRNA or cDNA generated from mRNA using a 
10 polymerase enzyme can be purified and separated using gel electrophoresis. The nucleic 
acids on the gel are then blotted onto a solid support, such as nitrocellulose. The solid 
support is exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the probe is labeled 
with radioactivity. 

1 5 Mapping. Polynucleotides of the present invention are used to identify a 

chromosome on which the corresponding gene resides. Such mapping can be useful in 
identifying the function of the polynucleotide-related gene by its proximity to other genes 
with known function. Function can also be assigned to the polynucleotide-related gene when 
particular syndromes or diseases map to the same chromosome. For example, use of 

20 polynucleotide probes in identification and quantification of nucleic acid sequence 
aberrations is described in U.S. Patent No. 5,783,387. 

For example, fluorescence in situ hybridization (FISH) on normal metaphase spreads 
facilitates comparative genomic hybridization to allow total genome assessment of changes 
in relative copy number of DNA sequences. See Schwartz and Samad, Curr. Opin. 

25 Biotechnol ( 1 994) 5:70; Kallioniemi et aL, Sem. Cancer Biol (1 993) 4:4 1 ; Valdes et al , 
Methods in Molecular Biology (1997) dS:l, Boultwood, ed., Human Press, Totowa, NJ. 
Preparations of human metaphase chromosomes are prepared using standard cytogenetic 
techniques from human primary tissues or cell lines. Nucleotide probes comprising at least 
12 contiguous nucleotides selected from the nucleotide sequence shown in the Sequence 

30 Listing are used to identify the corresponding chromosome. The nucleotide probes are 

labeled, for example, with a radioactive, fluorescent, biotinylated, or chemiluminescent label, 
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and detected by well known methods appropriate for the particular label selected. Protocols 

for hybridizing nucleotide probes to preparations of metaphase chromosomes are also well 

known in the art. A nucleotide probe will hybridize specifically to nucleotide sequences in 

the chromosome preparations that are complementary to the nucleotide sequence of the 

5 probe. 

Polynucleotides are mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach et al, Advances in Genetics, 
(1 995) 55:63-99; Walter et al, Nature Genetics (1994) 7:22; Walter and Goodfellow, Trends 
in Genetics (1992) 9:352. Panels for radiation hybrid mapping are available from Research 

10 Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 
available via the world wide web at http:/F/shgc-www.stanford.edu; and http://www- 
genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl . The statistical program RHMAP can be used 
to construct a map based on the data from radiation hybridization with a measure of the 
relative likelihood of one order versus another. RHMAP is available via the world wide web 

1 5 at http://www.sph.umich.edu/group/statgen/software. 

In addition, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Polynucleotides based on 
the polynucleotides of the invention can be used to probe these regions. For example, if 
through profile searching a provided polynucleotide is identified as corresponding to a gene 

20 encoding a kinase, its ability to bind to a cancer-related chromosomal region will suggest its 
role as a kinase in one or more stages of tumor cell development/growth. Although some 
experimentation would be required to elucidate the role, the polynucleotide constitutes a new 
material for isolating a specific protein that has potential for developing a cancer diagnostic 
or therapeutic. 

25 Tissue Typing or Profiling. Expression of specific mRNA corresponding to the 

provided polynucleotides can vary in different cell types and can be tissue-specific. This 
variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types. For example, PCR, branched DNA probe assays, or 
blotting techniques utilizing nucleic acid probes substantially identical or complementary to 

30 polynucleotides listed in the Sequence Listing can determine the presence or absence of the 
corresponding cDNA or mRNA. 
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For example, a metastatic lesion is identified by its developmental organ or tissue 

source by identifying the expression of a particular marker of that organ or tissue. If a 

polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to 

express that polynucleotide, then the developmental source of the lesion has been identified. 

5 Expression of a particular polynucleotide is assayed by detection of either the corresponding 

mRNA or the protein product. Immunological methods, such as antibody staining, are used 

to detect a particular protein product. Hybridization methods can be used to detect particular 

mRNA species, including but not limited to in situ hybridization and Northern blotting. 

Use of Polymorphisms. A polynucleotide of the invention will be useful in forensics, 

10 genetic analysis, mapping, and diagnostic applications if the corresponding region of a gene 
is polymorphic in the human population. Particular polymorphic forms of the provided 
polynucleotides can be used to either identify a sample as deriving from a suspect or rule out 
the possibility that the sample derives from the suspect. Any means for detecting a 
polymorphism in a gene are used, including but not limited to electrophoresis of protein 

15 polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 
hybridization to allele-specific probes. 
B. Antibody Production 

Expression products of a polynucleotide of the invention, the corresponding mRNA 
or cDNA, or the corresponding complete gene are prepared and used for raising antibodies 

20 for experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a 

corresponding gene has not been assigned, this provides an additional method of identifying 
the corresponding gene. The polynucleotide or related cDNA is expressed as described 
above, and antibodies are prepared. These antibodies are specific to an epitope on the 
polypeptide encoded by the polynucleotide, and can precipitate or bind to the corresponding 

25 native protein in a cell or tissue preparation or in a cell-free extract of an in vitro expression 
system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides encoded 
by the polynucleotides of the present invention with adjuvants. Alternatively, polypeptides 
are made as fusion proteins to larger immunogenic proteins. Polypeptides are also 
30 covalently linked to other larger immunogenic proteins, such as keyhole limpet hemocyanin. 
Immunogens are typically administered intradermally, subcutaneously, or intramuscularly. 
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Immunogens are administered to experimental animals such as rabbits, sheep, and mice, to 

generate antibodies. Optionally, the animal spleen cells are isolated and fused with myeloma 

cells to form hybridomas which secrete monoclonal antibodies. Such methods are well 

known in the art. According to another method known in the art, the selected polynucleotide 

5 is administered directly, such as by intramuscular injection, and expressed in vivo. The 

expressed protein generates a variety of protein-specific immune responses, including 

production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides 

encoded by a selected polynucleotide are made using standard methods known in the art. 

10 The antibodies specifically bind to epitopes present in the polypeptides encoded by 
polynucleotides disclosed in the Sequence Listing. Typically, at least 6, 8, 10, or 12 
contiguous amino acids are required to form an epitope. However, epitopes which involve 
non-contiguous amino acids may require more, for example at least 15, 25, or 50 amino 
acids. A short sequence of a polynucleotide may then be unsuitable for use as an epitope to 

15 raise antibodies for identifying the corresponding novel protein, because of the potential for 
cross-reactivity with a known protein. However, the antibodies can be useful for other 
purposes, particularly if they identify common structural features of a known protein and a 
novel polypeptide encoded by a polynucleotide of the invention. 

Antibodies that specifically bind to human polypeptides encoded by the provided 

20 polypeptides should provide a detection signal at least 5-, 10-, or 20-fold higher than a 
detection signal provided with other proteins when used in Western blots or other 
immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
invention do not bind to other proteins in immunochemical assays at detectable levels and 
can immunoprecipitate the specific polypeptide from solution. 

25 To test for the presence of serum antibodies to the polypeptide of the invention in a 

human population, human antibodies are purified by methods well known in the art. 
Preferably, the antibodies are affinity purified by passing antiserum over a column to which 
the corresponding selected polypeptide or fusion protein is bound. The bound antibodies can 
then be eluted from the column, for example using a buffer with a high salt concentration. 
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In addition to the antibodies discussed above, genetically engineered antibody 

derivatives are made, such as single chain antibodies, according to methods well known in 

the art. 

C. Use of Polynucleotides to Construct Arrays for Diagnostics 

5 Polynucleotide arrays provide a high throughput technique that can assay a large 

number of polynucleotide sequences in a sample. This technology can be used as a 
diagnostic and as a tool to test for differential expression to determine function of an 
encoded protein. Arrays can be created by spotting polynucleotide probes onto a substrate 
(e.g., glass, nitrocelllose, etc.) in a two-dimensional matrix or array having bound probes. 

10 The probes can be bound to the substrate by either covalent bonds or by non-specific 

interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably 
labeled (e.g., using radioactive or fluorescent labels) and then hybridized to the probes. 
Double stranded polynucleotides, comprising the labeled,sample polynucleotides bound to 
probe polynucleotides, can be detected once the unbound portion of the sample is washed 

1 5 away. Techniques for constructing arrays and methods of using these arrays are described in 
EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 97/27317; EP No. 0 785 280; PCT 
No. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP No. 0 728 520; 
U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 
95/22058; and U.S. Pat. No. 5,631,734. 

20 As discussed in some detail above, arrays can be used to examine differential 

expression of genes and can be used to determine gene function. For example, arrays of the 
instant polynucleotide sequences can be used to determine if any of the provided 
polynucleotides are differentially expressed between a test cell and control cell (e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer 

25 cell, which is not observed in a corresponding normal cell, can indicate a cancer specific 
protein. Exemplary uses of arrays are further described in, for example, Pappalarado et al y 
Sem. Radiation Oncol (1998) 5:217; and Ramsay Nature Biotechnoi (1998) 16:40. 

D. Differential Expression 

The polynucleotides of the invention can also be used to detect differences in 
30 expression levels between two cells, e.g. , as a method to identify abnormal or diseased tissue 
in a human. For polynucleotides corresponding to profiles of protein families as described 
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above, the choice of tissue can be selected according to the putative biological function. In 

general, the expression of a gene corresponding to a specific polynucleotide is compared 
between a first tissue that is suspected of being diseased and a second, normal tissue of the 
human. The tissue suspected of being abnormal or diseased can be derived from a different 
tissue type of the human, but preferably it is derived from the same tissue type; for example 
an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of interest 
(e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 
pancreas, and the mucosal lining of the colon). A difference between the polynucleotide- 
related gene, mRNA, or protein in the two tissues which are compared, for example in 
molecular weight, amino acid or nucleotide sequence, or relative abundance, indicates a 
change in the gene, or a gene which regulates it, in the tissue of the human that was 
suspected of being diseased. Examples of detection of differential expression and its use in 
diagnosis of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

The polynucleotide-related genes in the two tissues are compared by any means 
known in the art. For example, the two genes can be sequenced, and the sequence of the 
gene in the tissue suspected of being diseased compared with the gene sequence in the 
normal tissue. The genes corresponding to a provided polynucleotide, or portions thereof, in 
the two tissues are amplified, for example using nucleotide primers based on the nucleotide 
sequence shown in the Sequence Listing, using the polymerase chain reaction. The 
amplified genes or portions of genes are hybridized to detectably labeled nucleotide probes 
selected from a nucleotide sequence shown in the Sequence Listing. A difference in the 
nucleotide sequence of the isolated gene in the tissue suspected of being diseased compared 
with the normal nucleotide sequence suggests a role of the gene product encoded by the 
subject polynucleotide in the disease, and provides guidance for preparing a therapeutic 
agent. 

Alternatively, mRNA corresponding to a provided polynucleotide in the two tissues 
is compared. PolyA + RNA is isolated from the two tissues as is known in the art. For 
example, one of skill in the art can readily determine differences in the size or amount of 
mRNA transcripts between the two tissues using Northern blots and detectably labeled 
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nucleotide probes selected from the nucleotide sequence shown in the Sequence Listing. 

Increased or decreased expression of a given mRNA in a tissue sample suspected of being 

diseased, compared with the expression of the same mRNA in a normal tissue, suggests that 

the expressed protein has a role in the disease, and also provides a lead for preparing a 

5 therapeutic agent. 

The comparison can also be accomplished by analyzing polypeptides between the 

matched samples. The sizes of the proteins in the two tissues are compared, for example, 

using antibodies of the present invention to detect polypeptides in Western blots of protein 

extracts from the two tissues. Other changes, such as expression levels and subcellular 

10 localization, can also be detected immunologically, using antibodies to the corresponding 
protein. A higher or lower level of expression of a given polypeptide in a tissue suspected of 
being diseased, compared with the same protein expression level in a normal tissue, is 
indicative that the expressed protein has a role in the disease, and provides guidance for 
preparing a therapeutic agent. 

1 5 Similarly, comparison of polynucleotide sequences or of gene expression products, 

e.g., mRNA and protein, between a human tissue that is suspected of being diseased and a 
normal tissue of a human, are used to follow disease progression or remission in the human. 
Such comparisons are made as described above. For example, increased or decreased 
expression of a gene corresponding to an inventive polynucleotide in the tissue suspected of 

20 being neoplastic can indicate the presence of neoplastic cells in the tissue. The degree of 
increased expression of a given gene in the neoplastic tissue relative to expression of the 
same gene in normal tissue, or differences in the amount of increased expression of a given 
gene in the neoplastic tissue over time, is used to assess the progression of the neoplasia in 
that tissue or to monitor the response of the neoplastic tissue to a therapeutic protocol over 

25 time. 

The expression pattern of any two cell types can be compared, such as low and high 
metastatic tumor cell lines, malignant or non-malignant cells, or cells from tissue which have 
and have not been exposed to a therapeutic agent. A genetic predisposition to disease in a 
human is detected by comparing expression levels of an mRNA or protein corresponding to 
30 a polynucleotide of the invention in a fetal tissue with levels associated in normal fetal 
tissue. Fetal tissues that are used for this purpose include, but are not limited to, amniotic 
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fluid, chorionic villi, blood, and the blastomere of an in vitro-fertilized embryo. The 

comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA or 

protein is obtained from a normal tissue of a human in which the polynucleotide-related gene 

is expressed. Differences such as alterations in the nucleotide sequence or size of the same 

5 product of the fetal polynucleotide-related gene or mRNA, or alterations in the molecular 

weight, amino acid sequence, or relative abundance of fetal protein, can indicate a germline 

mutation in the polynucleotide-related gene of the fetus, which indicates a genetic 

predisposition to disease. Particular diagnostic and prognostic uses of the disclosed 

polynucleotides are described in more detail below. 

10 E. Diagnostic, Prognostic, and Other Uses Based On Differential Expression 

In general, diagnostic methods of the invention for involve detection of a level or 
amount of a gene product, particularly a differentially expressed gene product, in a test 
sample obtained from a patient suspected of having or being susceptible to a disease (e.g., 
breast cancer, lung cancer, colon cancer and/or metastatic forms thereof), and comparing the 

15 detected levels to those levels found in normal cells (e.g., cells substantially unaffected by 
cancer) and/or other control cells (e.g., to differentiate a cancerous cell from a cell affected 
by dysplasia). Furthermore, the severity of the disease can be assessed by comparing the 
detected levels of a differentially expressed gene product with those levels detected in 
samples representing the levels of differentially gene product associated with varying 

20 degrees of severity of disease. 

The term "differentially expressed gene" is intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g., a 
polypeptide), and/or introns of such genes and adjacent 5' and 3' non-coding nucleotide 
sequences involved in the regulation of expression, up to about 20 kb beyond the coding 

25 region, but possibly further in either direction. The gene can be introduced into an 

appropriate vector for extrachromosomal maintenance or for integration into a host genome. 
In general, a difference in expression level associated with a decrease in expression level of 
at least about 25%, usually at least about 50% to 75%, more usually at least about 90% or 
more is indicative of a differentially expressed gene of interest, i.e., a gene that is 

30 underexpressed or down-regulated in the test sample relative to a control sample. 

Furthermore, a difference in expression level associated with an increase in expression of at 

50 



WO 99/33982 PCT/US98/27610 
least about 25%, usually at least about 50% to 75%, more usually at least about 90% and can 

be at least about 1 !/ 2 -fold, usually at least about 2-fold to about 10-fold, and can be about 

100-fold to about 1,000-fold increase relative to a control sample is indicative of a 

differentially expressed gene of interest, i.e., an overexpressed or up-regulated gene. 

5 "Differentially expressed polynucleotide" as used herein means a nucleic acid 

molecule (RNA or DNA) having a sequence that represents a differentially expressed gene, 

e.g., the differentially expressed polynucleotide comprises a sequence {e.g., an open reading 

frame encoding a gene product) that uniquely identifies a differentially expressed gene so 

that detection of the differentially expressed polynucleotide in a sample is correlated with the 

10 presence of a differentially expressed gene in a sample. "Differentially expressed 

polynucleotides" is also meant to encompass fragments of the disclosed polynucleotides, 
e.g., fragments retaining biological activity, as well as nucleic acids homologous, 
substantially similar, or substantially identical (e.g., having about 90% sequence identity) to 
the disclosed polynucleotides. 

15 Methods of the subject invention useful in diagnosis or prognosis typically involve 

comparison of the abundance of a selected differentially expressed gene product in a sample 
of interest with that of a control to determine any relative differences in the expression of the 
gene product, where the difference can be measured qualitatively and/or quantitatively. 
Quantitation can be accomplished, for example, by comparing the level of expression 

20 product detected in the sample with the amounts of product present in a standard curve. A 
comparison can be made visually; by using a technique such as densitometry, with or 
without computerized assistance; by preparing a representative library of cDNA clones of 
mRNA isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number 

25 of clones corresponding to that same gene product relative to the number of clones of the 
same gene product in a control sample; or by using an array to detect relative levels of 
hybridization to a selected sequence or set of sequences, and comparing the hybridization 
pattern to that of a control. The differences in expression are then correlated with the 
presence or absence of an abnormal expression pattern. A variety of different methods for 

30 determining the nucleic acid abundance in a sample are known to those of skill in the art, 
where particular methods of interest include those described in: Pietu et al. Genome Res. 
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(1996) 6:492; Zhao et al, Gene (1995) 755:207; Soares , Curr. Opin. BiotechnoL (1977) 8: 

542; Raval, J. Pharmacol Toxicol Methods (1994) 52:125; Chalifour et al, Anal Biochem 

(1994) 27(5:299; Stolz et al, Mol BiotechnoL (1996) (5:225; Hong et al, Biosci. Reports 

(1982)2:907; and McGraw, Anal Biochem. (1984)745:298. Also of interest are the 

5 methods disclosed in WO 97/273 1 7, the disclosure of which is herein incorporated by 

reference. 

In general, diagnostic assays of the invention involve detection of a gene product of a 
the polynucleotide sequence (e.g., mRNA or polypeptide) that corresponds to a sequence of 
SEQ ID NOS: 1-844. The patient from whom the sample is obtained can be apparently 

10 healthy, susceptible to disease (e.g., as determined by family history or exposure to certain 
environmental factors), or can already be identified as having a condition in which altered 
expression of a gene product of the invention is implicated. 

In the assays of the invention, the diagnosis can be determined based on detected 
gene product expression levels of a gene product encoded by at least one, preferably at least 

15 two or more, at least 3 or more, or at least 4 or more of the polynucleotides having a 

sequence set forth in SEQ ID NOS: 1-844, and can involve detection of expression of genes 
corresponding to all of SEQ ID NOS: 1-844 and/or additional sequences that can serve as 
additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably 

20 involves detection of a gene product encoded by a gene corresponding to a polynucleotide 
that is differentially expressed in cancer. For example, a higher level of expression of a 
polynucleotide corresponding to SEQ ID NO:52 relative to a level associated with a normal 
sample can indicate the presence of cancer in the patient from whom the sample is derived. 
In another example, detection of a lower level of a polynucleotide corresponding to SEQ ID 

25 NO:39 relative to a normal level is indicative of the presence of cancer in the patient. 
Further examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their 
relative expression levels provided herein, assays using such polynucleotides and detection 
of their expression levels in diagnosis and prognosis will be readily apparent to the ordinarily 

30 skilled artisan. 
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Any of a variety of detectable labels can be used in connection with the various 

embodiments of the diagnostic methods of the invention. Suitable detectable labels include 

fluorochromes,(e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 

phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2\7'-dimethoxy-4\5'- 

5 dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

2\4\7\4,74iexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'- 

tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, (e.g. 32 P, 35 S, 3 H, etc.), and 

the like. The detectable label can involve a two stage systems (e.g., biotin-avidin, hapten- 

anti-hapten antibody, etc.) 

1 0 Reagents specific for the polynucleotides and polypeptides of the invention, such as 

antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an 
expression product in a biological sample. The kit can also contain buffers or labeling 
components, as well as instructions for using the reagents to detect and quantify expression 
products in the biological sample. Exemplary embodiments of the diagnostic methods of the 

1 5 invention are described below in more detail. 

Polypeptide detection in diagnosis. In one embodiment, the test sample is assayed 
for the level of a differentially expressed polypeptide. Diagnosis can be accomplished using 
any of a number of methods to determine the absence or presence or altered amounts of the 
differentially expressed polypeptide in the test sample. For example, detection can utilize 

20 staining of cells or histological sections with labeled antibodies, performed in accordance 
with conventional methods. Cells can be permeabilized to stain cytoplasmic molecules. In 
general, antibodies that specifically bind a differentially expressed polypeptide of the 
invention are added to a sample, and incubated for a period of time sufficient to allow 
binding to the epitope, usually at least about 10 minutes. The antibody can be detectably 

25 labeled for direct detection (e.g. , using radioisotopes, enzymes, fluoresces, 

chemiluminescers, and the like), or can be used in conjunction with a second stage antibody 
or reagent to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a 
secondary antibody conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, 
Texas red, etc.). The absence or presence of antibody binding can be determined by various 

30 methods, including flow cytometry of dissociated cells, microscopy, radiography, 

scintillation counting, etc. Any suitable alternative methods can of qualitative or quantitative 
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detection of levels or amounts of differentially expressed polypeptide can be used, for 

example ELISA, western blot, immunoprecipitation, radioimmunoassay, etc. 

In general, the detected level of differentially expressed polypeptide in the test 

sample is compared to a level of the differentially expressed gene product in a reference or 

5 control sample, e.g., in a normal cell (negative control) or in a cell having a known disease 

state (positive control). For example, a higher level of expression of a polypeptide encoded 

by SEQ ID NO:52 relative to a level associated with a normal sample can indicate the 

presence of cancer in the patient from whom the sample is derived. In another example, 

detection of a lower level of the polypeptide encoded by SEQ ID NO:39 relative to a normal 

1 0 level is indicative of the presence of cancer in the patient. 

mRNA detection. The diagnostic methods of the invention can also or alternatively 
involve detection of mRNA encoded by a gene corresponding to a differentially expressed 
polynucleotides of the invention. Any suitable qualitative or quantitative methods known in 
the art for detecting specific mRNAs can be used. mRNA can be detected by, for example, 

1 5 in situ hybridization in tissue sections, by reverse transcriptase-PCR, or in Northern blots 
containing poly A+ mRNA. One of skill in the art can readily use these methods to 
determine differences in the size or amount of mRNA transcripts between two samples. For 
example, the level of mRNA of the invention in a tissue sample suspected of being 
cancerous or dysplastic is compared with the expression of the mRNA in a reference sample, 

20 e.g., a positive or negative control sample (e.g., normal tissue, cancerous tissue, etc.). In a 
specific non-limiting example, a higher level of mRNA corresponding to SEQ ID NO:52 
relative to a level associated with a normal sample can indicate the presence of cancer in the 
patient from whom the sample is derived. In another example, detection of a lower level of 
mRNA corresponding to SEQ ID NO:39 relative to a normal level is indicative of the 

25 presence of cancer in the patient. 

Any suitable method for detecting and comparing mRNA expression levels in a 
sample can be used in connection with the diagnostic methods of the invention (see, e.g., 
U.S. 5,804,382). For example, mRNA expression levels in a sample can be determined by 
generation of a library of expressed sequence tags (ESTs) from the sample, where the EST 

30 library is representative of sequences present in the sample (Adams, et aL, (1991) Science 
252:1651). Enumeration of the relative representation of ESTs within the library can be used 
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to approximate the relative representation of the gene transcript within the starting sample. 

The results of EST analysis of a test sample can then be compared to EST analysis of a 

reference sample to determine the relative expression levels of a selected polynucleotide, 

particularly a polynucleotide corresponding to one or more of the differentially expressed 

5 genes described herein. 

Alternatively, gene expression in a test sample can be performed using serial analysis 

of gene expression (SAGE) methodology (Velculescu et al., Science (1995) 270:484). In 

short, SAGE involves the isolation of short unique sequence tags from a specific location 

within each transcript (e.g., a sequence of any one of SEQ ID NOS:l-6). The sequence tags 

1 0 are concatenated, cloned, and sequenced. The frequency of particular transcripts within the 
starting sample is reflected by the number of times the associated sequence tag is 
encountered with the sequence population. 

Gene expression in a test sample can also be analyzed using differential display (DD) 
methodology. In DD, fragments defined by specific sequence delimiters (e.g., restriction 

15 enzyme sites) are used as unique identifiers of genes, coupled with information about 

fragment length or fragment location within the expressed gene. The relative representation 
of an expressed gene with a sample can then be estimated based on the relative 
representation of the fragment associated with that gene within the pool of all possible 
fragments. Methods and compositions for carrying out DD are well known in the art, see, 

20 e.g., U.S. 5,776,683; and U.S. 5,807,680. 

Alternatively, gene expression in a sample using hybridization analysis, which is 
based on the specificity of nucleotide interactions. Oligonucleotides or cDNA can be used to 
selectively identify or capture DNA or RNA of specific sequence composition, and the 
amount of RNA or cDNA hybridized to a known capture sequence determined qualitatively 

25 or quantitatively, to provide information about the relative representation of a particular 
message within the pool of cellular messages in a sample. Hybridization analysis can be 
designed to allow for concurrent screening of the relative expression of hundreds to 
thousands of genes by using, for example, array-based technologies having high density 
formats, including filters, microscope slides, or microchips, or solution-based technologies 

30 that use spectroscopic analysis (e.g., mass spectrometry). One exemplary use of arrays in the 
diagnostic methods of the invention is described below in more detail. 
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Use of a single gene in diagnostic applications. The diagnostic methods of the 

invention can focus on the expression of a single differentially expressed gene. For example, 

the diagnostic method can involve detecting a differentially expressed gene, or a 

polymorphism of such a gene (e.g., a polymorphism in an coding region or control region), 

5 that is associated with disease. Disease-associated polymorphisms can include deletion or 

truncation of the gene, mutations that alter expression level and/or affect activity of the 

encoded protein, etc. 

Changes in the promoter or enhancer sequence that affect expression levels of an 

differentially gene can be compared to expression levels of the normal allele by various 

10 methods known in the art. Methods for determining promoter or enhancer strength include 
quantitation of the expressed natural protein; insertion of the variant control element into a 
vector with a reporter gene such as P-galactosidase, luciferase, chloramphenicol 
acetyltransferase, etc. that provides for convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 

15 specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA 
are available, genomic DNA is used directly. Alternatively, the region of interest is cloned 
into a suitable vector and grown in sufficient quantity for analysis. Cells that express a 
differentially expressed gene can be used as a source of mRNA, which can be assayed 
directly or reverse transcribed into cDNA for analysis. The nucleic acid can be amplified by 

20 conventional techniques, such as the polymerase chain reaction (PCR), to provide sufficient 
amounts for analysis, and a detectable label can be included in the amplification reaction 
(e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to facilitate 
detection. The use of the polymerase chain reaction is described in Saiki, et aL, Science 
(1985) 259:487, and a review of techniques can be found in Sambrook, et al, Molecular 

25 Cloning: A Laboratory Manual, (1989) pp. 14.2. Alternatively, various methods are known 
in the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et ai, Nucl. Acids Res. (1990) 75:2887; and Delahunty et al, Am. J. 
Hum. Genet. (1996) 55:1239. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 

30 number of methods known in the art. The nucleic acid can be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a selected sequence, e.g., to a wild-type 
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sequence. Hybridization with the polymorphic or variant sequence can also be used to 

determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The hybridization 

pattern of a polymorphic or variant sequence and a control sequence to an array of 

oligonucleotide probes immobilized on a solid support, as described in US 5,445,934, or in 

5 WO 95/35505, can also be used as a means of identifying polymorphic or variant sequences 

associated with disease. Single strand conformational polymorphism (SSCP) analysis, 

denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices 

are used to detect conformational changes created by DNA sequence variation as alterations 

in electrophoretic mobility. Alternatively, where a polymorphism creates or destroys a 

1 0 recognition site for a restriction endonuclease, the sample is digested with that endonuclease, 

and the products size fractionated to determine whether the fragment was digested. 

Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 

agarose gels. 

Screening for mutations in an differentially expressed gene can be based on the 

15 functional or antigenic characteristics of the protein. Protein truncation assays are useful in 
detecting deletions that can affect the biological activity of the protein. Various 
immunoassays designed to detect polymorphisms in proteins can be used in screening. 
Where many diverse genetic mutations lead to a particular disease phenotype, functional 
protein assays have proven to be effective screening tools. The activity of the encoded 

20 protein can be determined by comparison with the wild-type protein. 

Pattern matching in diagnosis using arrays. In another embodiment, the diagnostic 
and/or prognostic methods of the invention involve detection of expression of a selected set 
of genes in a test sample to produce a test expression pattern (TEP). The TEP is compared to 
a reference expression pattern (REP), which is generated by detection of expression of the 

25 selected set of genes in a reference sample (e.g. , a positive or negative control sample). The 
selected set of genes includes at least one of the genes of the invention, which genes 
correspond to the polynucleotide sequences of SEQ ID NOS: 1 -844. Of particular interest is 
a selected set of genes that includes gene differentially expressed in the disease for which the 
test sample is to be screened. 

30 "Reference sequences" or "reference polynucleotides" as used herein in the context of 

differential gene expression analysis and diagnosis/prognosis refers to a selected set of 
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polynucleotides, which selected set includes at least one or more of the differentially 

expressed polynucleotides described herein. A plurality of reference sequences, preferably 

comprising positive and negative control sequences, can be included as reference sequences. 

Additional suitable reference sequences are found in Genbank, Unigene, and other 

nucleotide sequence databases (including, e.g., expressed sequence tag (EST), partial, and 

full-length sequences). 

"Reference array" means an array having reference sequences for use in hybridization 

with a sample, where the reference sequences include all, at least one of, or any subset of the 

differentially expressed polynucleotides described herein. Usually such an array will include 

at least 3 different reference sequences, and can include any one or all of the provided 

differentially expressed sequences. Arrays of interest can further comprise sequences, 

including polymorphisms, of other genetic sequences, particularly other sequences of interest 

for screening for a disease or disorder {e.g., cancer, dysplasia, or other related or unrelated 

diseases, disorders, or conditions). The oligonucleotide sequence on the array will usually 

be at least about 12 nt in length, and can be of about the length of the provided sequences, or 

can extend into the flanking regions to generate fragments of 100 nt to 200 nt in length or 

more. 

A "reference expression pattern" or "REP" as used herein refers to the relative levels 
of expression of a selected set of genes, particularly of differentially expressed genes, that is 
associated with a selected cell type, e.g., a normal cell, a cancerous cell, a cell exposed to an 
environmental stimulus, and the like. A "test expression pattern" or "TEP" refers to relative 
levels of expression of a selected set of genes, particularly of differentially expressed genes, 
in a test sample (e.g., a cell of unknown or suspected disease state, from which mRNA is 
isolated). 

"Diagnosis" as used herein generally includes determination of a subject's 
susceptibility to a disease or disorder, determination as to whether a subject is presently 
affected by a disease or disorder, as well as to the prognosis of a subject affected by a disease 
or disorder (e.g., identification of pre-metastatic or metastatic cancerous states, stages of 
cancer, or responsiveness of cancer to therapy). The present invention particularly 
encompasses diagnosis of subjects in the context of breast cancer (e.g., carcinoma in situ 
(e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive breast cancer, ER-negative 
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breast cancer, or other forms and/or stages of breast cancer), lung cancer (e.g., small cell 

carcinoma, non-small cell carcinoma, mesothelioma, and other forms and/or stages of lung 
cancer), and colon cancer (e.g., adenomatous polyp, colorectal carcinoma, and other forms ■ 
and/or stages of colon cancer). 
5 "Sample" or "biological sample" as used throughout here are generally meant to refer 

to samples of biological fluids or tissues, particularly samples obtained from tissues, 
especially from cells of the type associated with the disease for which the diagnostic 
application is designed (e.g., ductal adenocarcinoma), and the like. "Samples" is also meant 
to encompass derivatives and fractions of such samples (e.g., cell lysates). Where the sample 

10 is solid tissue, the cells of the tissue can be dissociated or tissue sections can be analyzed. 

REPs can be generated in a variety of ways according to methods well known in the 
art. For example, REPs can be generated by hybridizing a control sample to an array having 
a selected set of polynucleotides (particularly a selected set of differentially expressed 
polynucleotides), acquiring the hybridization data from the array, and storing the data in a 

1 5 format that allows for ready comparison of the REP with a TEP. Alternatively, all expressed 
sequences in a control sample can be isolated and sequenced, e.g., by isolating mRNA from 
a control sample, converting the mRNA into cDNA, and sequencing the cDNA. The 
resulting sequence information roughly or precisely reflects the identity and relative number 
of expressed sequences in the sample. The sequence information can then be stored in a 

20 format (e.g. , a computer-readable format) that allows for ready comparison of the REP with 
a TEP. The REP can be normalized prior to or after data storage, and/or can be processed to 
selectively remove sequences of expressed genes that are of less interest or that might 
complicate analysis (e.g., some or all of the sequences associated with housekeeping genes 
can be eliminated from REP data). 

25 TEPs can be generated in a manner similar to REPs, e.g. , by hybridizing a test sample 

to an array having a selected set of polynucleotides, particularly a selected set of 
differentially expressed polynucleotides, acquiring the hybridization data from the array, and 
storing the data in a format that allows for ready comparison of the TEP with a REP. The 
REP and TEP to be used in a comparison can be generated simultaneously, or the TEP can 

30 be compared to previously generated and stored REPs. 



59 



WO 99/33982 PCT/US98/2761 0 

In one embodiment of the invention, comparison of a TEP with a REP involves 

hybridizing a test sample with a reference array, where the reference array has one or more 

reference sequences for use in hybridization with a sample. The reference sequences include 

all, at least one of, or any subset of the differentially expressed polynucleotides described 

5 herein. Hybridization data for the test sample is acquired, the data normalized, and the 

produced TEP compared with a REP generated using an array having the same or similar 

selected set of differentially expressed polynucleotides. Probes that correspond to sequences 

differentially expressed between the two samples will show decreased or increased 

hybridization efficiency for one of the samples relative to the other. 

10 Reference arrays can be produced according to any suitable methods known in the 

art. For example, methods of producing large arrays of oligonucleotides are described in 
U.S. 5,134,854, and U.S. 5,445,934 using light-directed synthesis techniques. Using a 
computer controlled system, a heterogeneous array of monomers is converted, through 
simultaneous coupling at a number of reaction sites, into a heterogeneous array of polymers. 

1 5 Alternatively, microarrays are generated by deposition of pre-synthesized oligonucleotides 
onto a solid substrate, for example as described in PCT published application no. 
WO 95/35505. 

Methods for collection of data from hybridization of samples with a reference arrays 
are also well known in the art. For example, the polynucleotides of the reference and test 

20 samples can be generated using a detectable fluorescent label, and hybridization of the 

polynucleotides in the samples detected by scanning the microarrays for the presence of the 
detectable label. Methods and devices for detecting fluorescently marked targets on devices 
are known in the art. Generally, such detection devices include a microscope and light 
source for directing light at a substrate. A photon counter detects fluorescence from the 

25 substrate, while an x-y translation stage varies the location of the substrate. A confocal 
detection device that can be used in the subject methods is described in U.S. Patent no. 
5,631,734. A scanning laser microscope is described in Shalon et al., Genome Res, (1996) 
d:639. A scan, using the appropriate excitation line, is performed for each fluorophore used. 
The digital images generated from the scan are then combined for subsequent analysis. For 

30 any particular array element, the ratio of the fluorescent signal from one sample (e.g., a test 
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sample) is compared to the fluorescent signal from another sample (e.g., a reference sample), 

and the relative signal intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are well known 

in the art. For example, where detection of hybridization involves a fluorescent label, data 

5 analysis can include the steps of determining fluorescent intensity as a function of substrate 

position from the data collected, removing outliers, Le. data deviating from a predetermined 

statistical distribution, and calculating the relative binding affinity of the targets from the 

remaining data. The resulting data can be displayed as an image with the intensity in each 

region varying according to the binding affinity between targets and probes. 

1 0 In general, the test sample is classified as having a gene expression profile 

corresponding to that associated with a disease or non-disease state by comparing the TEP 
generated from the test sample to one or more REPs generated from reference samples (e.g., 
from samples associated with cancer or specific stages of cancer, dysplasia, samples affected 
by a disease other than cancer, normal samples, etc.). The criteria for a match or a 

1 5 substantial match between a TEP and a REP include expression of the same or substantially 
the same set of reference genes, as well as expression of these reference genes at 
substantially the same levels (e.g., no significant difference between the samples for a signal 
associated with a selected reference sequence after normalization of the samples, or at least 
no greater than about 25% to about 40% difference in signal strength for a given reference 

20 sequence. In general, a pattern match between a TEP and a REP includes a match in 

expression, preferably a match in qualitative or quantitative expression level, of at least one 
of, all or any subset of the differentially expressed genes of the invention. 

Pattern matching can be performed manually, or can be performed using a computer 
program. Methods for preparation of substrate matrices (e.g., arrays), design of 

25 oligonucleotides for use with such matrices, labeling of probes, hybridization conditions, 
scanning of hybridized matrices, and analysis of patterns generated, including comparison 
analysis, are described in, for example, U.S. 5,800,992. 

F. Use of the Polynucleotides of the Invention in Cancer 
Oncogenesis involves the unbridled growth, dedifferentiation and abnormal 

30 migration of cells. Cancerous cells can have the ability to compress, invade, and destroy 
normal tissue. Cancerous cells may also metastasize to other parts of the body via the 
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bloodstream or the lymph system and colonize in these other areas. Different cancers are 

classified by the cell from which the cancerous cell is derived and from its cellular 

morphology and/or state of differentiation. 

Somatic genetic abnormalities cause cancer initiation and progression. Cancer 

5 generally is clonally formed, i.e. gain of function of oncogenes and loss of function of tumor 

suppressor genes within a single cell transform the cell to be cancerous, and that single cell 

grows and divides to form a cancerous lesion. The genes known to be involved in cancer 

initiation and progression are involved in numerous cellular functions, including 

developmental differentiation, cell cycle regulation, cell signaling, immunological response, 

1 0 DN A replication, and DNA repair. 

The identification and characterization of genetic or biochemical markers in blood or 
tissues that will detect the earliest changes along the carcinogenesis pathway and monitor the 
efficacy of various therapies and preventive interventions is a major goal of cancer research. 
Scientists have identified genetic changes in stool specimens that indicate the stages of colon 

1 5 cancer, and other biomarkers such as gene mutations, hormone receptors, proteins that 
inhibit metastasis, and enzymes that metabolize drugs are all being used to determine the 
severity and predict the course of breast, prostate, lung, and other cancers. 

Recent advances in the pathogenesis of certain cancers has been helpful in 
determining patient treatment. The level of expression of certain polynucleotides can be 

20 indicative of a poorer prognosis, and therefore warrant more aggressive chemo- or radio- 
therapy for a patient. The correlation of novel surrogate tumor specific features with 
response to treatment and outcome in patients has defined certain prognostic indicators that 
allow the design of tailored therapy based on the molecular profile of the tumor. These 
therapies include antibody targeting and gene therapy. Moreover, a promising level of one 

25 or more marker polynucleotides can provide impetus for not aggressively treating a 

particular patient, thus sparing the patient the deleterious side effects of aggressive therapy. 
Determining expression of certain polynucleotides and comparison of a patients profile with 
known expression in normal tissue and variants of the disease allows a determination of the 
best possible treatment for a patient, both in terms of specificity of treatment and in terms of 

30 comfort level of the patient. 



62 



WO 99/33982 PCT/US98/2761 0 

Surrogate tumor markers, such as polynucleotide expression, can also be used to 

better classify, and thus diagnose and treat, different forms and disease states of cancer. 

Two classifications widely used in oncology that can benefit from identification of the 

expression levels of the polynucleotides of the invention are staging of the cancerous 

5 disorder, and grading the nature of the cancerous tissue. 

Staging. Staging is a process used by physicians to describe how advanced the 

cancerous state is in a patient. Staging assists the physician in determining a prognosis, 

planning treatment and evaluating the results of such treatment. Different staging systems 

are used for different types of cancer, but each generally involves the following 

10 determinations: the type of tumor, indicated by T; whether the cancer has metastasized to 
nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. This system of staging is called the TNM system. 
Generally, if a cancer is only detectable in the area of the primary lesion without having 
spread to any lymph nodes it is called Stage I. If it has spread only to the closest lymph 

15 nodes, it is called Stage II. In Stage III, the cancer has generally spread to the lymph nodes 
in near proximity to the site of the primary lesion. Cancers that have spread to a distant part 
of the body, such as the liver, bone, brain or another site, are called Stage IV, the most 
advanced stage. 

Currently, the determination of staging is done using pathological techniques and is 
20 based more on the presence or absence of malignant tissue rather than the characteristics of 
the tumor type. Presence or absence of malignant tissue is based primarily on the gross 
morphology of the cells in the areas biopsied. The polynucleotides of the invention can 
facilitate fine-tuning of the staging process by identifying markers for the aggresivity of a 
cancer, e.g. the metastatic potential, as well as the presence in different areas of the body. 
25 Thus, a Stage II cancer with a polynucleotide signifying a high metastatic potential cancer 
can be used to change a borderline Stage II tumor to a Stage III tumor, justifying more 
aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
30 normal tissue of its same type. Based on the microscopic appearance of a tumor, 

pathologists will identify the grade of a tumor based on parameters such as cell morphology, 
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cellular organization, and other markers of differentiation. As a general rule, the grade of a 

tumor corresponds to its rate of growth or aggressiveness. » That is, undifferentiated or high- 
grade tumors grow more quickly than well differentiated or low-grade tumors. Information 
about tumor grade is useful in planning treatment and predicting prognosis. 
5 The American Joint Commission on Cancer has recommended the following 

guidelines for grading tumors: 1) GX Grade cannot be assessed; 2) Gl Well differentiated; 
G2 Moderately well differentiated; 3) G3 Poorly differentiated; 4) G4 Undifferentiated. 
Although grading is used by pathologists to describe most cancers, it plays a more important 
role in treatment planning for certain types than for others. An example is the Gleason 
10 system that is specific for prostate cancer, which uses grade numbers to describe the degree 
of differentiation. Lower Gleason scores indicate well-differentiated cells. Intermediate 
scores denote tumors with moderately differentiated cells. Higher scores describe poorly 
differentiated cells. Grade is also important in some types of brain tumors and soft tissue 
sarcomas. 

15 The polynucleotides of the invention can be especially valuable in determining the 

grade of the tumor, as they not only can aid in determining the differentiation status of the 
cells of a tumor, they can also identify factors other than differentiation that are valuable in 
determining the aggressivity of a tumor, such as metastatic potential. 

Familial Cancer Genes. A number of cancer syndromes are linked to Mendelian 

20 inheritance of a predisposition to develop particular cancers. The following table contains a 
list of cancer types that can be inherited, and for which the gene or genes responsible have 
been identified. Most of the cancer types listed can occur as part of several different genetic 
conditions, each caused by alterations in a different gene. 



Cancer Type 


Genetic Condition 


Gene 


Brain 




Li-Fraumeni syndrome 
Neurofibromatosis 1 
Neurofibromatosis 2 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


TP53 

NFl 

NF2 

VHL 

TSC2 


Breast 




Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 
Li-Fraumeni syndrome 
Ataxia telangiectasia 


BRCAl 
BRCA2 
TP53 
ATM 


Colon 


Familial adenomatous polyposis (FAP) 
Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 


APC 

HMSH2 

hMLHl 
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Cancer Type 


Genetic Condition 
Hereditary non-poiyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


Gene 
hPMSl 
hPMS2 


Endocrine 

(parathyroid, pituitary, GI endocrine) 


Multiple endocrine neoplasia 1 (MEN1) 


MEN1 


Endocrine 

(pheochromacytoma, medullary thyroid) 


Multiple endocrine neoplasia 2 (MEN2) 


RET 


Endometrial 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 


Eye 


Hereditary retinoblastoma 


RBI 


Hematologic 

f lvmnhonias and leukemia) 


Li-Fraumeni syndrome 
Ataxia telangiectasia 


TP53 
ATM 


Kidney 


Hereditary Wilms' tumor 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


WT1 
VHL 
TSC2 


Ovary 


Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 


BRCA1 
BRCA2 


Sarcoma 


Hereditary retinoblastoma 
Li-Fraumeni syndrome 
Neurofibromatosis 1 


RBI 

TP53 

NF1 


Skin 


Hereditary melanoma 1 
Hereditary melanoma 2 
Basal cell naevus (Gorlin) syndrome 


CDKN2 

CDK4 

PTCH 


Stomach 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 



The polynucleotides of the invention can be especially useful to monitor patients having any 
of the above syndromes to detect potentially malignant events at a molecular level before 
they are detectable at a gross morphological level. As can be seen from the table, a number 

5 of genes are involved in multiple forms of cancer. Thus, a polynucleotide of the invention 
identified as important for metastatic colon cancer can also have clinical implications for a 
patient diagnosed with stomach cancer or endometrial cancer. 

Lung Cancer. Lung cancer is one of the most common cancers in the United States, 
accounting for about 15 percent of all cancer cases, or 170,000 new cases each year. At this 

10 time, over half of the lung cancer cases in the United States are in men, but the number found 
in women is increasing and will soon equal that in men. Today more women die of lung 
cancer than of breast cancer. Lung cancer is especially difficult to diagnose and treat 
because of the large size of the lungs, which allows cancer to develop for years undetected. 
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In fact, lung cancer can spread outside the lungs without causing any symptoms. Adding to 

the confusion, the most common symptom of lung cancer, a persistent cough, can often be 

mistaken for a cold or bronchitis. 

Although there are more than a dozen different kinds of lung cancer, the two main 

5 types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all 

lung cancer cases. Small cell carcinoma (also called oat cell carcinoma), which usually starts 

in one of the larger bronchial tubes, grows fairly rapidly, and is likely to be large by the time 

of diagnosis. Nonsmall cell lung cancer (NSCLC) is made up of three general subtypes of 

lung cancer. Epidermoid carcinoma (also called squamous cell carcinoma) usually starts in 

10 one of the larger bronchial tubes and grows relatively slowly. The size of these tumors can 
range from very small to quite large. Adenocarcinoma starts growing near the outside 
surface of the lung and can vary in both size and growth rate. Some slowly growing 
adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. 

1 5 Other less common forms of l^ung cancer are carcinoid, cylindroma, mucoepidermoid, and 
malignant mesothelioma. 

Currently, CT scans, MRIs, X-rays, sputum cytology, and biopsies are used to 
diagnose nonsmall cell lung cancer. The form and cellular origin of the lung cancer is 
diagnosed primarily through biopsy from either a surgical biopsy or a needle aspiration of 

20 lung tissue, and usually the biopsy is prompted from an abnormality identified on an X-ray. 
In some cases, sputum cytology can reveal lung cancers in patients with normal X-rays or 
can determine the type of lung cancer, but because it cannot pinpoint the tumor's location, a 
positive sputum cytology test is usually followed by further tests. Since these tests are based 
in large part on gross morphology of the tissue, the diagnosis of a particular kind of tumor is 

25 largely subjective, and the diagnosis can vary significantly between clinicians. 

The polynucleotides of the invention can be used to distinguish types of lung cancer 
as well as identifying traits specific to a certain patient's cancer. For example, if the patient's 
biopsy expresses a polynucleotide that is associated with a low metastatic potential, it may 
justify leaving a larger portion of the patient's lung in surgery to remove the lesion. 

30 Alternatively, a smaller lesion with expression of a polynucleotide that is associated with 
high metastatic potential may justify a more radical removal of lung tissue and/or the 
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surrounding lymph nodes, even if no metastasis can be identified through pathological 

examination. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
5 polynucleotide in hyperplasia can be used as a diagnostic marker for metastatic lung cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
those that exhibit differential expression between high metastatic versus low metastatic lung 
cancer , i.e. SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 381, 395, and 400. Detection of malignant lung cancer with a higher 
1 0 metastatic potential can be determined using expression levels of any of these sequences 
alone or in combination with the levels of expression of other known genes. 

Breast Cancer. The National Cancer Institute (NCI) estimates that about 1 in 8 
women in the United States will develop breast cancer during her lifetime. Clinical breast 
examination and mammography are recommended as combined modalities for breast cancer 
1 5 screening, and the nature of the cancer will often depend upon the location of the tumor and 
the cell type from which the tumor is derived. The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 

Ductal carcinoma in situ (DCIS): Ductal carcinoma in situ is the most common type 
of noninvasive breast cancer. In DCIS, the malignant cells have not metastasized through 
20 the walls of the ducts into the fatty tissue of the breast. Comedocarcinoma is a type of DCIS 
that is more likely than other types of DCIS to come back in the same area after 
lumpectomy. It is more closely linked to eventual development of invasive ductal carcinoma 
than other forms of DCIS. 

Infiltrating (or invasive) ductal carcinoma (IDC): this type of cancer has metastasized 
25 through the wall of the duct and invaded the fatty tissue of the breast. At this point, it has the 
potential to use the lymphatic system and bloodstream for metastasis to more distant parts of 
the body. Infiltrating ductal carcinoma accounts for about 80% of breast cancers. 

Lobular carcinoma in situ (LCIS): While not a true cancer, LCIS (also called lobular 
neoplasia) is sometimes classified as a type of noninvasive breast cancer. It does not 
30 penetrate through the wall of the lobules. Although it does not itself usually become an 
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invasive cancer, women with this condition have a higher risk of developing an invasive 

breast cancer in the same breast, or in the opposite breast. 

Infiltrating (or invasive) lobular carcinoma (ILC): ILC is similar to IDC, in that it has 
the potential metastasize elsewhere in the body. About 10% to 15% of invasive breast 
5 cancers are invasive lobular carcinomas. ILC can be more difficult to detect by 
mammogram than IDC. 

Inflammatory breast cancer: This rare type of invasive breast cancer accounts for 
about 1 % of all breast cancers and is extremely aggressive. Multiple skin symptoms 
associated with this cancer are caused by cancer cells blocking lymph vessels or channels in 
1 0 the skin over the breast. 

Medullary carcinoma: This special type of infiltrating breast cancer has a relatively 
well defined, distinct boundary between tumor tissue and normal tissue. It accounts for 
about 5% of breast cancers. The prognosis for this kind of breast cancer is better than for 
other types of invasive breast cancer. 
1 5 Mucinous carcinoma: This rare type of invasive breast cancer originates from mucus- 

producing cells. The prognosis for mucinous carcinoma is better than for the more common 
types of invasive breast cancer. 

Paget's disease of the nipple: This type of breast cancer starts in the ducts and spreads 
to the skin of the nipple and the areola. It is a rare type of breast cancer, occurring in only 
20 1% of all cases. Paget's disease can be associated with in situ carcinoma, or with infiltrating 
breast carcinoma. If no lump can be felt in the breast tissue, and the biopsy shows DCIS but 
no invasive cancer, the prognosis is excellent. 

Phyllodes tumor: This very rare type of breast tumor forms from the stroma of the 
breast, in contrast to carcinomas which develop in the ducts or lobules. Phyllodes (also 
25 spelled phylloides) tumors are usually benign, but are malignant on rare occasions. 

Nevertheless, malignant phyllodes tumors are very rare and less than 10 women per year in 
the US die of this disease. Benign phyllodes tumors are successfully treated by removing the 
mass and a narrow margin of normal breast tissue. 

Tubular carcinoma: Accounting for about 2% of all breast cancers, tubular 
30 carcinomas are a special type of infiltrating breast carcinoma. They have a better prognosis 
than usual infiltrating ductal or lobularcarcinomas. 
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High-quality mammography combined with clinical breast exam remains the only 

screening method clearly tied to reduction in breast cancer mortality. Lower dose x-rays, 

digitized computer rather than film images, and the use of computer programs to assist 

diagnosis, are almost ready for widespread dissemination. Other technologies also are being 

5 developed, including magnetic resonance imaging and ultrasound. In addition, a very low 

radiation exposure technique, positron emission tomography has the potential for detecting 

early breast cancer. 

It is also possible to differentiate between non-cancerous breast tissue and malignant 
breast tissue by analyzing differential gene expression between tissues. In addition, there 

10 may be several possible alterations that lead to the various possible types of breast cancer. 
The different types of breast tumors (e.g., invasive vs. non-invasive, ductal vs. axillary 
lymph node) can be differentiable from one another by the identification of the differences in 
genes expressed by different types of breast tumor tissues (Porter- Jordan et aL, Hematol 
Oncol Clin North Am (1994) 5:73). Breast cancer can thus be generally diagnosed by 

1 5 detection of expression of a gene or genes associated with breast tumors. Where enough 
information is available about the differential gene expression between various types of 
breast tumor tissues, the specific type of breast tumor can also be diagnosed. 

For example, increased estrogen receptor (ER) expression in normal breast 
epithileum, while not itself indicative of malignant tissue, is a known risk marker for 

20 development of breast cancer. Khan S A et al , Cancer Res ( 1 994) 54:993 . Malignant breast 
cancer is often divided into two groups, ER-positive and ER-negative, based on the estrogen 
receptor status of the tissue. The ER status represents different survival length and response 
to hormone therapy, and is thought to represent either: 1) an indicator of different stages of 
the disease, or 2) an indicator that allows differentiation between two similar but distinct 

25 diseases. K. Zhu et al, Med. Hypotk (1997) 49:69. A number of other genes are known to 
vary expression between either different stages of cancer or different types of similar breast 
cancer. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis and management of breast cancer. The differential expression of a polynucleotide 
30 in human breast tumor tissue can be used as a diagnostic marker for human breast cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
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those that exhibit differential expression between breast cancer tissue with a high metastatic 

potential and a low metastatic potential, i.e. SEQ ID NOS: 9, 42, 52, 62, 65, 66, 68, 1 14, 

123, 144, 172, 178, 214, 219, 223, 258, 317, and 379. Detection of breast cancer can be 

determined using expression levels of any of these sequences alone or in combination. 

5 Determination of the aggressive nature and/or the metastatic potential of a breast cancer can 

also be determined by comparing levels of one or more polynucleotides of the invention and 

comparing levels of another sequence known to vary in cancerous tissue, e.g. ER expression. 

In addition, development of breast cancer can be detected by examining the ratio of SEQ ID 

NO: to the levels of steroid hormones (e.g., testosterone or estrogen) or to other hormones 

10 (e.g. , growth hormone, insulin). Thus expression of specific marker polynucleotides can 
be used to discriminate between normal and cancerous breast tissue, to discriminate between 
breast cancers with different cells of origin, to discriminate between breast cancers with 
different potential metastatic rates, etc. 

Diagnosis of breast cancer can also involve comparing the expression of a 

15 polynucleotide of the invention with the expression of other sequences in non-malignant 
breast tissue samples in comparison to one or more forms of the diseased tissue. A 
comparison of expression of one or more polynucleotides of the invention between the 
samples provides information on relative levels of these polynucleotides as well as the ratio 
of these polynucleotides to the expression of other sequences in the tissue of interest 

20 compared to normal. 

This risk of breast cancer is elevated significantly by the presence of an inherited risk 
for breast cancer, such as a mutation in BRCA-1 or BRCA-2. New diagnostic tools are 
being developed to address the needs of higher risk patients to complement mammography 
and physical examinations for early detection of breast cancer, particularly among younger 

25 women. The presence of antigen or expression markers in nipple aspirate fluid (NAF) 
samples collected from one or both breasts can be useful for useful for risk assessment or 
early cancer detection. Breast cytology and biomarkers obtained by random fine needle 
aspiration have been used to identify hyperplasia with atypia and overexpression of p53 and 
EGFR. The polynucleotides of the invention can be used in multivariate analysis with 

30 expression studies with genes such as p53 and EGFR as risk predictors and as surrogate 
endpoint biomarkers for breast cancer. 
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As well as being used for diagnosis and risk assessment, the expression of certain 

genes can also correlated to prognosis of a disease state. The expression of particular gene 

have been used as prognostic indicators for breast cancer including increased expression of 

c-erbB-2, pS2, ER, progesterone receptor, epidermal growth factor receptor (EGFR), neu, 

5 myc, bcl-2, int2, cytosolic tyrosine kinase, cyclin E,prad-] 9 hst, uPA, PAI-1, PAI-2, 

cathepsin D, as well as the presence of a number of cancer-specific antigens, e.g. CEA, CA 

M26, CA M29 and CA 15.3. Davis, Br. J. BiomedSci. (1996) 55:157. Poor prognosis has 

also been linked to a decrease in expression of certain genes, such as p5 3, Rb, nm23. The 

expression of the polynucleotides of the invention can be of prognostic value for determining 

1 0 the metastatic potential of a malignant breast cancer, as this molecules are differentially 
expressed between high and low metastatic potential tissues tumors. The levels of these 
polynucleotides in patients with malignant breast cancer can compared to normal tissue, 
malignant tissue with a known high potential metastatic level, and malignant tissue with a 
known lower level of metastatic potential to provide a prognosis for a particular patient. 

1 5 Such a prognosis is predictive of the extent and nature of the cancer. The determined 
prognosis is useful in determining the prognosis of a patient with breast cancer, both for 
initial treatment of the disease and for longer-term monitoring of the same patient. If 
samples are taken from the same individual over a period of time, differences in 
polynucleotide expression that are specific to that patient can be identified and closely 

20 watched. 

Colon Cancer. Colorectal cancer is one of the most common neoplasms in humans 
and perhaps the most frequent form of hereditary neoplasia. Prevention and early detection 
are key factors in controlling and curing colorectal cancer. Indeed, colorectal cancer is the 
second most preventable cancer, after lung cancer. Colorectal cancer begins as polyps, 

25 which are small, benign growths of cells that form on the inner lining of the colon. Over a 
period of several years, some of these polyps accumulate additional mutations and become 
cancerous. About 20 percent of all cases of colon cancer are thought to be related to 
heredity. Currently, multiple familial colorectal cancer disorders have been identified, which 
are summarized as follows: 

30 Familial adenomatous polyposis (FAP): This condition results in a person having 

hundreds or even thousands of polyps in the colon and rectum that usually first appear during 
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the teenage years. Cancer nearly always develops in one or more of these polyps between 

the ages of 30 and 50. 

Gardner's syndrome: Like FAP, Gardner's syndrome results in polyps and colorectal 

cancers that develop at a young age. It can also cause benign tumors of the skin, soft 
5 connective tissue and bones. 

Hereditary nonpolyposis colon cancer (HNPCC): People with this condition tend to 

develop colorectal cancer at a young age, without first having many polyps. HNPCC has an 

autosomal dominant pattern of inheritance with variable but high penetrance estimated to be 

about 90%. HNPCC underlies 0.5%- 10% of all cases of colorectal cancer. An understanding 
1 0 of the mechanisms behind the development of HNPCC is emerging, and genetic 

presymptomatic testing, now being conducted in research settings, soon will be available on 

a widespread basis for individuals identified at risk for this disease. 

Familial colorectal cancer in Ashkenazi Jews: Recent research has found an inherited 

tendency to developing colorectal cancer among some Jews of Eastern European descent. 
1 5 Like people with FAP, Gardner's syndrome, and HNPCC, their increased risk is due to an 

inherited mutation present in about 6% of American Jews. 

Several tests are currently used to screen for colorectal cancer, including digital rectal 

examination, fecal occult blood test, sigmoidoscopy, colonoscopy, virtual colonoscopy and 

MRI. Each of these tests identifies potential colorectal cancer lesions, or a risk of 
20 development of these lesions, at a fairly gross morphological level. 

The sequential alteration of a number of genes is associated with malignant 

adenocarcinoma, including the genes DCC, p53, ras, and FAP. For a review, see e.g. Fearon 

ER, et aU Cell (1990) 61(5):759; Hamilton SR et al. 9 Cancer (1 993) 72:957; Bodmer W, et 

al.NatGenet. (1994) 4(3)2 17; Fearon ER, Ann N Y Acad Sci. (1995) 768:\0\. Molecular 
25 genetic alterations are thus promising as potential diagnostic and prognostic indicators in 

colorectal carcinoma and molecular genetics of colorectal carcinoma since it is possible to 

differentiate between different types of colorectal neoplasias using molecular markers. 

Colorectal cancer can thus be generally diagnosed by detection of expression of a gene or 

genes associated with colorectal tumors. 
30 Similarly, the expression of polynucleotides of the invention can be used in the 

diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
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polynucleotide in hyperplasia can be used as a diagnostic marker for colon cancer. The 

polynucleotides of the invention that would be especially useful for this purpose are those 

that exhibit differential expression between malignant metastatic colon cancer and normal 

patient tissue , i.e. SEQ ID NOS: 52, 119, 172, 288. Detection of malignant colon cancer 

5 can be determined using expression levels of any of these sequences alone or in combination 

with the levels of expression. 

Determination of the aggressive nature and/or the metastatic potential of a colon 

cancer can also be determined by comparing levels of one or more polynucleotides of the 

invention and comparing total levels of another sequence known to vary in cancerous tissue, 

10 e.g. p53 expression. In addition, development of colon cancer can be detected by examining 
the ratio of any of the polynucleotides of the invention to the levels of oncogenes (e.g. ras) 
or tumor suppressor genes (e.g. FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous breast tissue, to 
discriminate between breast cancers with different cells of origin, to discriminate between 

1 5 breast cancers with different potential metastatic rates, etc. 

G. Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
Polypeptides encoded by the instant polynucleotides and corresponding full length 
genes can be used to screen peptide libraries to identify binding partners, such as receptors, 
from among the encoded polypeptides. 

20 A library of peptides can be synthesized following the methods disclosed in U.S. Pat. 

No. 5,010,175 C 175), and in WO 91/17823. As described below in brief, one prepares a 
mixture of peptides, which is then screened to identify the peptides exhibiting the desired 
signal transduction and receptor binding activity. In the '175 method, a suitable peptide 
synthesis support (e.g., a resin) is coupled to a mixture of appropriately protected, activated 

25 amino acids. The concentration of each amino acid in the reaction mixture is balanced or 
adjusted in inverse proportion to its coupling reaction rate so that the product is an equimolar 
mixture of amino acids coupled to the starting resin. The bound amino acids are then 
deprotected, and reacted with another balanced amino acid mixture to form an equimolar 
mixture of all possible dipeptides. This process is repeated until a mixture of peptides of the 

30 desired length (e.g., hexamers) is formed. Note that one need not include all amino acids in 
each step: one can include only one or two amino acids in some steps (e.g., where it is 
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known that a particular amino acid is essential in a given position), thus reducing the 

complexity of the mixture. After the synthesis of the peptide library is completed, the 

mixture of peptides is screened for binding to the selected polypeptide. The peptides are 

then tested for their ability to inhibit or enhance activity. Peptides exhibiting the desired 

5 activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, instead of reacting the 

synthesis resin with a mixture of activated amino acids, the resin is divided into twenty equal 

portions (or into a number of portions corresponding to the number of different amino acids 

to be added in that step), and each amino acid is coupled individually to its portion of resin. 

10 The resin portions are then combined, mixed, and again divided into a number of equal 

portions for reaction with the second amino acid. In this manner, each reaction can be easily 
driven to completion. Additionally, one can maintain separate "subpools" by treating 
portions in parallel, rather than combining all resins at each step. This simplifies the process 
of determining which peptides are responsible for any observed receptor binding or signal 

1 5 transduction activity. 

In such cases, the subpools containing, e.g., 1-2,000 candidates each are exposed to 
one or more polypeptides of the invention. Each subpool that produces a positive result is 
then resynthesized as a group of smaller subpools (sub-subpools) containing, e.g., 20-100 
candidates, and reassayed. Positive sub-subpools can be resynthesized as individual 

20 compounds, and assayed finally to determine the peptides that exhibit a high binding 
constant. These peptides can be tested for their ability to inhibit or enhance the native 
activity. The methods described in WO 91/7823 and U.S. Patent No. 5,194,392 (herein 
incorporated by reference) enable the preparation of such pools and subpools by automated 
techniques in parallel, such that all synthesis and resynthesis can be performed in a matter of 

25 days. 

Peptide agonists or antagonists are screened using any available method, such as 
signal transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis 
assays, etc. The methods described herein are presently preferred. The assay conditions 
ideally should resemble the conditions under which the native activity is exhibited in vivo, 
30 that is, under physiologic pH, temperature, and ionic strength. Suitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity at 
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concentrations that do not cause toxic side effects in the subject. Agonists or antagonists that 

compete for binding to the native polypeptide can require concentrations equal to or greater 

than the native concentration, while inhibitors capable of binding irreversibly to the 

polypeptide can be added in concentrations on the order of the native concentration. 

5 The end results of such screening and experimentation will be at least one novel 

polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA corresponding 

to a polynucleotide of the invention, and at least one peptide agonist or antagonist of the 

novel binding partner. Such agonists and antagonists can be used to modulate, enhance, or 

inhibit receptor function in cells to which the receptor is native, or in cells that possess the 

10 receptor as a result of genetic engineering. Further, if the novel receptor shares biologically 

important characteristics with a known receptor, information about agonist/antagonist 

binding can facilitate development of improved agonists/antagonists of the known receptor. 

H. Pharmaceutical Compositions and Therapeutic Uses 

Pharmaceutical compositions can comprise polypeptides, antibodies, or 

15 polynucleotides of the claimed invention. The pharmaceutical compositions will comprise a 

therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 

claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 

20 detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount for a subject 
will depend upon the subject's size and health, the nature and extent of the condition, and the 
therapeutics or combination of therapeutics selected for administration. Thus, it is not useful 

25 to specify an exact effective amount in advance. However, the effective amount for a given 
situation is determined by routine experimentation and is within the judgment of the 
clinician. For purposes of the present invention, an effective dose will generally be from 
about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in 
the individual to which it is administered. 

30 A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 

The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
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therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 

The term refers to any pharmaceutical carrier that does not itself induce the production of 

antibodies harmful to the individual receiving the composition, and which can be 

administered without undue toxicity. Suitable carriers can be large, slowly metabolized 

5 macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 

polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers 

are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 

such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 

10 organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions can include liquids 
such as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or 

15 emulsifying agents, pH buffering substances, and the like, can also be present in such 

vehicles. Typically, the therapeutic compositions are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection can also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 

20 Delivery Methods. Once formulated, the compositions of the invention can be 

(1) administered directly to the subject (e.g., as polynucleotide or polypeptides); (2) 
delivered ex vivo, to cells derived from the subject (e.g., as in ex vivo gene therapy); or (3) 
delivered in vitro for expression of recombinant proteins (e.g. , polynucleotides). Direct 
delivery of the compositions will generally be accomplished by injection, either 

25 subcutaneously, intraperitoneally, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or 
lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment can be a single dose schedule or a multiple dose schedule. 

30 Methods for the ex vivo delivery and reimplantation of transformed cells into a 

subject are known in the art and described in e.g., International Publication No. WO 
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93/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 

particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 

accomplished by, for example, dextran-mediated transfection, calcium phosphate 

5 precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 

into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been found to 

correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the 

1 0 disorder can be amenable to treatment by administration of a therapeutic agent based on the 
provided polynucleotide or corresponding polypeptide. 

Preparation of antisense polynucleotides is discussed above. Neoplasias that are 
treated with the antisense composition include, but are not limited to, cervical cancers, 
melanomas, colorectal adenocarcinomas, Wilms' tumor, retinoblastoma, sarcomas, 

15 myosarcomas, lung carcinomas, leukemias, such as chronic myelogenous leukemia, 

promyelocyte leukemia, monocytic leukemia, and myeloid leukemia, and lymphomas, such 
as histiocytic lymphoma. Proliferative disorders that are treated with the therapeutic 
composition include disorders such as anhydric hereditary ectodermal dysplasia, congenital 
alveolar dysplasia, epithelial dysplasia of the cervix, fibrous dysplasia of bone, and 

20 mammary dysplasia. Hyperplasias, for example, endometrial, adrenal, breast, prostate, or 
thyroid hyperplasias or pseudoepitheliomatous hyperplasia of the skin, are treated with 
antisense therapeutic compositions based upon a polynucleotide of the invention. Even in 
disorders in which mutations in the corresponding gene are not implicated, downregulation 
or inhibition of expression of a gene corresponding to a polynucleotide of the invention can 

25 have therapeutic application. For example, decreasing gene expression can help to suppress 
tumors in which enhanced expression of the gene is implicated. 

Both the dose of the antisense composition and the means of administration are 
determined based on the specific qualities of the therapeutic composition, the condition, age, 
and weight of the patient, the progression of the disease, and other relevant factors. 

30 Administration of the therapeutic antisense agents of the invention includes local or systemic 
administration, including injection, oral administration, particle gun or catheterized 
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administration, and topical administration. Preferably, the therapeutic antisense composition 

contains an expression construct comprising a promoter and a polynucleotide segment of at 

least 12, 22, 25, 30, or 35 contiguous nucleotides of the antisense strand of a polynucleotide 

disclosed herein. Within the expression construct, the polynucleotide segment is located 

5 downstream from the promoter, and transcription of the polynucleotide segment initiates at 

the promoter. 

Various methods are used to administer the therapeutic composition directly to a 
specific site in the body. For example, a small metastatic lesion is located and the 
therapeutic composition injected several times in several different locations within the body 

10 of tumor. Alternatively, arteries which serve a tumor are identified, and the therapeutic 

composition injected into such an artery, in order to deliver the composition directly into the 
tumor. A tumor that has a necrotic center is aspirated and the composition injected directly 
into the now empty center of the tumor. The antisense composition is directly administered 
to the surface of the tumor, for example, by topical application of the composition. X-ray 

1 5 imaging is used to assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing an 
antisense polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues is 
also used. Receptor-mediated DNA delivery techniques are described in, for example, 
Findeis et al 9 Trends Biotechnol. (1993) 77:202; Chiou et al, Gene Therapeutics: Methods 

20 And Applications Of Direct Gene Transfer (J.A. Wolff, ed.) (1 994); Wu et ai, J. Biol. Chem. 
(1988) 255:621; Wu et ai, J. Biol Chem. (1994) 269:542; Zenke et ah, Proc. Natl Acad 
Sci. (USA) (1990) 57:3655; Wu et al 9 J. Biol Chem. (1991) 256:338. Preferably, receptor- 
mediated targeted delivery of therapeutic compositions containing antibodies of the 
invention is used to deliver the antibodies to specific tissue. 

25 Therapeutic compositions containing antisense subgenomic polynucleotides are 

administered in a range of about 100 ng to about 200 mg of DNA for local administration in 
a gene therapy protocol. Concentration ranges of about 500 ng to about 50 mg, about 1 [ig to 
about 2 mg, about 5 jag to about 500 ^ig, and about 20 ^g to about 100 jig of DNA can also 
be used during a gene therapy protocol. Factors such as method of action and efficacy of 

30 transformation and expression are considerations which will affect the dosage required for 
ultimate efficacy of the antisense subgenomic polynucleotides. Where greater expression is 
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desired over a larger area of tissue, larger amounts of antisense subgenomic polynucleotides 

or the same amounts readministered in a successive protocol of administrations, or several 

administrations to different adjacent or close tissue portions of, for example, a tumor site, 

may be required to effect a positive therapeutic outcome. In all cases, routine 

5 experimentation in clinical trials will determine specific ranges for optimal therapeutic 

effect. A more complete description of gene therapy vectors, especially retroviral vectors, is 

contained in U.S. Serial No. 08/869,309, which is expressly incorporated herein, and in 

section G below. 

For polynucleotide-related genes encoding polypeptides or proteins with anti- 
10 inflammatory activity, suitable use, doses, and administration are described in U.S. Patent 
No. 5,654,173. Therapeutic agents also include antibodies to proteins and polypeptides 
encoded by the polynucleotides of the invention and related genes, as described in U.S. 
Patent No. 5,654,173. 

I. Gene Therapy 

1 5 The therapeutic polynucleotides and polypeptides of the present invention can be 

utilized in gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral 
origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human Gene 
Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, Nature 
Genetics (1994) 5:148). Gene therapy vehicles for delivery of constructs including a coding 

20 sequence of a therapeutic of the invention can be administered either locally or systemically. 
These constructs can utilize viral or non-viral vector approaches. Expression of such coding 
sequences can be induced using endogenous mammalian or heterologous promoters. 
Expression of the coding sequence can be either constitutive or regulated. 

The present invention can employ recombinant retroviruses which are constructed to 

25 carry or express a selected nucleic acid molecule of interest. Retrovirus vectors that can be 
employed include those described in EP 0 415 731; WO 90/07936; WO 94/03622; WO 
93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/1 1230; WO 93/10218; Vile 
and Hart, Cancer Res. (1993) 55:3860; Vile et al, Cancer Res. (1993) 53:962; Ram et al., 
Cancer Res. (1993) 55:83; Takamiya et al, J. NeuroscL Res. (1992) 35:493; Baba et al., J. 

30 Neurosurg. (1993) 79:729; U.S. Patent No. 4,777,127; GB Patent No. 2,200,65 1 ; and EP 0 
345 242. Preferred recombinant retroviruses include those described in WO 91/02805. 
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Packaging cell lines suitable for use with the above-described retroviral vector 

constructs can be readily prepared (see, e.g., WO 95/30763 and WO 92/05266), and used to 

create producer cell lines (also termed vector cell lines) for the production of recombinant 

vector particles. Within particularly preferred embodiments of the invention, packaging cell 

5 lines are made from human (such as HT1080 cells) or mink parent cell lines, thereby 

allowing production of recombinant retroviruses that can survive inactivation in human 

serum. 

The present invention also employs alphavirus-based vectors that can function as 
gene delivery vehicles. Such vectors can be constructed from a wide variety of alphaviruses, 

1 0 including, for example, Sindbis virus vectors, Semliki forest virus (ATCC VR-67; ATCC 
VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan equine 
encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC VR-532). 
Representative examples of such vector systems include those described in U.S. Patent Nos. 
5,091,309; 5,217,879; and 5,185,440; WO 92/10578; WO 94/21792; WO 95/27069; WO 

15 95/27044; and WO 95/07994. Gene delivery vehicles of the present invention can also 

employ parvovirus such as adeno-associated virus (AAV) vectors. Representative examples 
include the AAV vectors disclosed by Srivastava in WO 93/09239, Samulski et al., J. Virol 
(1989) 65:3822; Mendelson et al., Virol (1988) 166:1 54; and Flotte et al, PNAS (1993) 
P0:1O613. 

20 Representative examples of adenoviral vectors include those described by Berkner, 

Biotechniques (1988) <J:616; Rosenfeld et al, Science (1991) 252:431; WO 93/19191; Kolls 
et al , PNAS (1994) 91:2\ 5; Kass-Eisler et al, PNAS (1 993) 90: 1 1498; Guzman et al, 
Circulation (1993) 55:2838; Guzman et al., Cir. Res. (1993) 75:1202; Zabner et al, Cell 
(1993) 75:207; Li et al, Hum. Gene Ther. (1993) 4:403; Cailaud et al, Eur. J. NeuroscL 

25 (1993) 5:1287; Vincent et al, Nat. Genet. (1993) 5:130; Jaffe et al, Nat. Genet. (1992) 

7:372; and Levrero et al, Gene (1991) 707:195. Exemplary adenoviral gene therapy vectors 
employable in this invention also include those described in WO 94/12649, WO 93/03769; 
WO 93/19191; WO 94/28938; WO 95/1 1984 and WO 95/00655. Administration of DNA 
linked to killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 5:147 can be 

30 employed. 
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Other gene delivery vehicles and methods can be employed, including polycationic 

condensed DNA linked or unlinked to killed adenovirus alone, for example CurieL Hum. 

Gene Then (1992) 5:147; ligand linked DNA, for example see Wu, J. Biol Chem. (1989) 

264:16985; eukaryotic cell delivery vehicles cells, for example see U.S. Pat. No. 5,814,482; 

5 WO 95/07994; WO 96/17072; WO 95/30763; and WO 97/42338; deposition of 

photopolymerized hydrogel materials; hand-held gene transfer particle gun, as described in 

U.S. Patent No. 5,149,655; ionizing radiation as described in U.S. Patent No. 5,206,152 and 

in W092/1 1033; nucleic charge neutralization or fusion with cell membranes. Additional 

approaches are described in Philip, Mol Cell Biol (1994) 74:241 1, and in Woffendin, Proc, 

1 0 Nail Acad Sci. ( 1 994) 91 : 1 58 1 . 

Naked DNA can also be employed. Exemplary naked DNA introduction methods are 
described in WO 90/1 1092 and U.S. Patent No. 5,580,859. Uptake efficiency can be 
improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method can be improved 

15 further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. Liposomes that can act as gene 
delivery vehicles are described in U.S. Patent No. 5,422,120; WO 95/13796; WO 94/23697; 
WO 91/14445; and EP 0524968. 

Further non-viral delivery suitable for use includes mechanical delivery systems such 

20 as the approach described in Woffendin et aL, Proc. Natl Acad Scl USA (1994) 

97(24):! 1581. Moreover, the coding sequence and the product of expression of such can be 
delivered through deposition of photopolymerized hydrogel materials. Other conventional 
methods for gene delivery that can be used for delivery of the coding sequence include, for 
example, use of hand-held gene transfer particle gun, as described in U.S. Patent No. 

25 5,149,655; use of ionizing radiation for activating transferred gene, as described in U.S. 
Patent No. 5,206,152 and WO 92/1 1033. 

The present invention will now be illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, it should be noted that 
these embodiments are illustrative and are not to be construed as restricting the invention in 

30 any way. 
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EXAMPLES 

The present invention is now illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, these embodiments are 
illustrative and are not meant to be construed as restricting the invention in any way. 

5 

Example 1 : Source of Biological Materials and Overview of Novel Polynucleotides 
Expressed by the Biological Materials 
Human colon cancer cell line Kml2L4-A (Morika, W. A. K. et al., Cancer Research 
(1988) 45:6863) was used to construct a cDNA library from mRNA isolated from the cells. 

1 0 As described in the above overview, a total of 4,693 sequences expressed by the Kml2L4-A 
cell line were isolated and analyzed; most sequences were about 275-300 nucleotides in 
length. The KM12L4-A cell line is derived from the KM12C cell line. The KM12C cell 
line, which is poorly metastatic (low metastatic) was established in culture from a Dukes' 
stage B 2 surgical specimen (Morikawa et al Cancer Res. (1988) 45:6863). The KML4-A is 

15 a highly metastatic subline derived from KM12C (Yeatman et al Nucl Acids. Res. (1995) 
25:4007; Bao-Ling et al Proc. Annu. Meet Am. Assoc. Cancer Res. (1995) 27:3269). The 
KM12C and KM12C-derived cell lines {e.g., KM12L4, KM12L4-A, etc.) are well- 
recognized in the art as a model cell line for the study of colon cancer (see, e.g., Moriakawa 
etal, supra; Radinsky et al Clin. Cancer Res. (1995) 7:19; Yeatman et al, (1995) supra; 

20 Yeatman et al Clin. Exp. Metastasis (1996) 74:246). 

The sequences were first masked to eliminate low complexity sequences using the XBLAST 
masking program (Claverie "Effective Large-Scale Sequence Similarity Searches," In: 
Computer Methods for Macromolecular Sequence Analysis . Doolittle, ed., Metk Enzymol 
266:212-227 Academic Press, NY, NY (1996); see particularly Claverie, in "Automated 

25 DNA Sequencing and Analysis Techniques" Adams et al, eds., Chap. 36, p. 267 Academic 
Press, San Diego, 1994 and Claverie etal Comput. Chem. (1993)17:191 ). Generally, 
masking does not influence the final search results, except to eliminate of relative little 
interest due to their lox complexity, and to eliminate multiple "hits" based on similarity to 
repetitive regions common to multiple sequences, e.g., Alu repeats. Masking resulted in the 

30 elimination of 43 sequences. The remaining sequences were then used in a BLASTN vs. 
Genbank search with search parameters of greater than 70% overlap, 99% identity, and a p 
value of less than 1 x 10" 40 , which search resulted in the discarding of 1,432 sequences. 
Sequences from this search also were discarded if the inclusive parameters were met, but the 
sequence was ribosomal or vector-derived. 
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The resulting sequences from the previous search were classified into three groups 

(1,2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database 

search: (1) unknown (no hits in the Genbank search), (2) weak similarity (greater than 45% 

identity and p value of less than 1 x 10" 5 ), and (3) high similarity (greater than 60% overlap, 

5 greater than 80% identity, and p value less than 1 x 10" 5 ). This search resulted in discard of 

98 sequences as having greater than 70% overlap, greater than 99% identity, and p value of 

less thanlxlO" 40 . 

The remaining sequences were classified as unknown (no hits), weak similarity, and 
high similarity (parameters as above). Two searches were performed on these sequences. 

10 First, a BLAST vs. EST database search resulted in discard of 1771 sequences (sequences 
with greater than 99% overlap, greater than 99% similarity and a p value of less than 1x10" 
40 ; sequences with a p value of less than 1 x 10' 65 when compared to a database sequence of 
human origin were also excluded). Second, a BLASTN vs. Patent GeneSeq database 
resulted in discard of 15 sequences (greater than 99% identity; p value less than 1 x 10" 40 ; 

1 5 greater than 99% overlap). 

The remaining sequences were subjected to screening using other rules and 
redundancies in the dataset. Sequences with a p value of less than 1 x 10 in relation to a 
database sequence of human origin were specifically excluded. The final result provided the 
404 sequences listed in the accompanying Sequence Listing. The Sequence Listing is 

20 arranged beginning with sequences with no similarity to any sequence in a database 
searched, and ending with sequences with the greatest similarity. Each identified 
polynucleotide represents sequence from at least a partial mRNA transcript. Polynucleotides 
that were determined to be novel were assigned a sequence identification number. 

The novel polynucleotides and were assigned sequence identification numbers SEQ 

25 ID NOS: 1-404. The DNA sequences corresponding to the novel polynucleotides are 
provided in the Sequence Listing. The majority of the sequences are presented in the 
Sequence Listing in the 5' to 3' direction. A small number, 25, are listed in the Sequence 
Listing in the 5' to 3' direction but the sequence as written is actually 3' to 5\ These 
sequences are readily identified with the designation "AR" in the Sequence Name in Table 1 

30 (inserted before the claims). The sequences correctly listed in the 5' to 3' direction in the 
Sequence Listing are designated "AF." The Sequence Listing filed herewith therefore 
contains 25 sequences listed in the reverse order, namely SEQ ID NOS:47, 97, 137, 171, 
173, 179, 182, 194,200, 202,213,227, 258, 264, 275,302,313,324, 329, 330, 331,338, 
358, 379, and 404. 
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Because the provided polynucleotides represent partial mRNA transcripts, two or 

more polynucleotides of the invention may represent different regions of the same mRNA 

transcript and the same gene. Thus, if two or more SEQ ID NOS: are identified as belonging 

to the same clone, then either sequence can be used to obtain the full-length mRNA or gene. 

5 In order to confirm the sequences of SEQ ID NOS : 1 -404, inserts of the clones 

corresponding to these polynucleotides were re-sequenced. These "validation" sequences 

are provided in SEQ ID NOS:405-800. These validation sequences were often longer than 

the original polynucleotide sequences. They validate, and thus often provide additional 

sequence information. Validation sequences can be correlated with the original sequences 

1 0 they validate by identifying those sequences of SEQ ID NOS : 1 -404 and the validation 

sequences of SEQ ID NOS:405-800 that share the same clone name in Table 1 . 

Example 2: Results of Public Database Search to Identify Function of Gene Products 

SEQ ID NOS: 1-404, as well as the validation sequences SEQ ID NOS:405-800, were 

15 translated in all three reading frames to determine the best alignment with the individual 

sequences. These amino acid sequences and nucleotide sequences are referred, generally, as 
query sequences, which are aligned with the individual sequences. Query and individual 
sequences were aligned using the BLAST programs, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BLAST/ . Again the sequences were masked to various extents 

20 to prevent searching of repetitive sequences or poly-A sequences, using the XBLAST 
program for masking low complexity as described above in Example 1 . 

Table 2 (inserted before the claims) shows the results of the alignments. Table 2 
refers to each sequence by its SEQ ID NO:, the accession numbers and descriptions of 
nearest neighbors from the Genbank and Non-Redundant Protein searches, and the p values 

25 of the search results. Table 1 identifies each SEQ ID NO: by SEQ name, clone ID, and 

cluster. As discussed above, a single cluster includes polynucleotides representing the same 
gene or gene family, and generally represents sequences encoding the same gene product. 

For each of SEQ ID NOS: 1-800, the best alignment to a protein or DNA sequence is 
included in Table 2. The activity of the polypeptide encoded by SEQ ID NOS: 1-800 is the 

30 same or similar to the nearest neighbor reported in Table 2. The accession number of the 
nearest neighbor is reported, providing a reference to the activities exhibited by the nearest 
neighbor. The search program and database used for the alignment also are indicated as well 
as a calculation of the p value. 
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Full length sequences or fragments of the polynucleotide sequences of the nearest 

neighbors can be used as probes and primers to identify and isolate the full length sequence 

of SEQ ID NOS: 1 -800. The nearest neighbors can indicate a tissue or cell type to be used to 

construct a library for the full-length sequences of SEQ ID NOS: 1-800. 

5 SEQ ID NOS : 1-800 and the translations thereof may be human homologs of known 

genes of other species or novel allelic variants of known human genes. In such cases, these 

new human sequences are suitable as diagnostics or therapeutics. As diagnostics, the human 

sequences SEQ ID NOS: 1-800 exhibit greater specificity in detecting and differentiating 

human cell lines and types than homologs of other species. The human polypeptides 

10 encoded by SEQ ID NOS: 1-800 are likely to be less immunogenic when administered to 

humans than homologs from other species. Further, on administration to humans, the 

polypeptides encoded by SEQ ID NOS: 1-800 can show greater specificity or can be better 

regulated by other human proteins than are homologs from other species. 

15 Example 3: Members of Protein Families 

After conducting a profile search as described in the specification above, several of the 
polynucleotides of the invention were found to encode polypeptides having characteristics of 
a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 3). Thus the 

20 invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 

Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQ ID 

NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


24 


4 transmembrane segments integral membrane proteins 


1218 


578 


rev 


41 


4 transmembrane segments integral membrane proteins 


1086 


413 


rev 


101 


4 transmembrane segments integral membrane proteins 


1206 


544 


rev 


157 


4 transmembrane segments integral membrane proteins 


721 


33 


rev 


341 


4 transmembrane segments integral membrane proteins 


1253 


613 


rev 


395 


4 transmembrane segments integral membrane proteins 


530 


10 


for 


395 


4 transmembrane segments integral membrane proteins 


696 


17 


for 


395 


4 transmembrane segments integral membrane proteins 


471 


39 


rev 


24 


7 transmembrane receptor (Secretin family) 


1301 


491 


rev 


41 


7 transmembrane receptor (Secretin family) 


1309 


10 


rev 


101 


7 transmembrane receptor (Secretin family) 


1330 


296 


rev 


157 


7 transmembrane receptor (Secretin family) 


1173 


249 


rev 


291 


7 transmembrane receptor (Secretin family) 


1400 


269 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


291 


7 transmembrane receptor (Secretin family) 


712 


130 


for 


305 


7 transmembrane receptor (Secretin family) 


926 


4 


for 


305 


7 transmembrane receptor (Secretin family) 


753 


55 


rev 


315 


7 transmembrane receptor (Secretin family) 


1058 


270 


rev 


341 


7 transmembrane receptor (Secretin family) 


1265 


534 


rev 


116 


Ank repeat 


141 


218 


for 


251 


Ank repeat 


290 


207 


for 


251 


Ank repeat 


467 


387 


for 


63 


ATPases Associated with Various Cellular Activities 


543 


60 


for 


116 


ATPases Associated with Various Cellular Activities 


802 


313 


for 


134 


ATPases Associated with Various Cellular Activities 


525 


57 


rev 


136 


ATPases Associated with Various Cellular Activities 


712 


163 


for 


151 


ATPases Associated with Various Cellular Activities 


719 


73 


for 


151 


ATPases Associated with Various Cellular Activities 


386 


13 


for 


384 


ATPases Associated with Various Cellular Activities 


664 


140 


for 


404 


ATPases Associated with Various Cellular Activities 


704 


52 


for 


374 |Basic region plus leucine zipper transcription factors 


298 


146 


for 


97 


Bromodomain (conserved sequence found in human, 
Drosophila and yeast proteins.) 


230 


63 


for 


136 


EF-hand 


121 


207 


.tu- 
tor 


242 


EF-hand 


238 


155 


ror 


379 


EF-hand 


212 


126 


fXr 

ror 


308 |Eukaryotic aspartyl proteases 


1300 


461 


rev 


2 1 3 |GATA family of transcription factors 


720 377 


for 


367 


G-protein alpha subunit 


971 


467 


rev 


188 


Phorbol esters/diacylglycerol binding 


91 


177 


for 


251 


Phorbol esters/diacylglycerol binding 


133 


219 


for 


202 


protein kinase 


482 


1 


rev 


202 


protein kinase 


970 


1 


rev 


315 


protein kinase 


739 


158 


for 


315 


protein kinase 


1023 


197 


for 


367 


protein kinase 


1046 


285 


rev 


397 


protein kinase 


511 


6 


i or 


256 


Protein phosphatase 2C 


13 


90 


tu- 
tor 


256 


Protein phosphatase 2C 


163 


86 


ior 


382 


Protein Tyrosine Phosphatase 


261 


2 


tor 


306 


SH3 Domain 


141 


296 


for 


386 


SH3 Domain 


359 


209 


for 


169 


Trypsin 


764 


164 


rev 


188 


WD domain, G-beta repeats 


480 


382 


for 


188 


WD domain, G-beta repeats 


206 


117 


for 


335 


WD domain, G-beta repeats 


3 


92 


for 


23 


wnt family of developmental signaling proteins 


1151 


335 


rev 


291 


wnt family of developmental signaling proteins 


779 


89 


rev 


291 


wnt family of developmental signaling proteins 


1347 


382 


rev 


324 


wnt family of developmental signaling proteins 


1180 


499 


rev 


330 


wnt family of developmental signaling proteins 


1180 


499 


rev 


341 


wnt family of developmental signaling proteins 


1399 


560 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQ ID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


353 


wnt family of developmental signaling proteins 


880 


49 


rev 


188 


WW/rsp5/WWP domain containing proteins 


431 


354 


for j 


379 


WW/rsp5/WWP domain containing proteins 


12 


89 


for | 


395 


WW/rsp5/WWP domain containing proteins 


153 


76 


fu- 
ror 


395 


WW/rsp5/WWP domain containing proteins 


156 


64 


for j 


61 


Zinc finger, C2H2 type 


254 


192 


for | 


306 


Zinc finger, C2H2 type 


428 


367 


for 1 


386 


Zinc finger, C2H2 type 


191 


253 


for I 


322 |Zinc finger, CCHC class 


553 


503 


for j 


306 


Zinc-binding metalloprotease domain 


101 


60 


rev 


395 


Zinc-binding metalloprotease domain 


28 


69 


rev I 

i 



Start and stop indicate the position within the individual sequenes that align with the 
query sequence having the indicated SEQ ID NO. The direction (Dir) indicates the 
orientation of the query sequence with respect to the individual sequence, where forward 
5 (for) indicates that the alignment is in the same direction (left to right) as the sequence 
provided in the Sequence Listing and reverse (rev) indicates that the alignment is with a 
sequence complementary to the sequence provided in the Sequence Listing. 

Some polynucleotides exhibited multiple profile hits because, for example, the particular 
10 sequence contains overlapping profile regions, and/or the sequence contains two different 
functional domains. These profile hits are described in more detail below. 

a) Four Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24, 41, 101, 
157, 341, and 395 correspond to a sequence encoding a polypeptide that is a member of the 4 
transmembrane segments integral membrane protein family (transmembrane 4 family). The 
1 5 transmembrane 4 family of proteins includes a number of evolutionarily-related eukaryotic 
cell surface antigens (Levy et al y J. Biol Chem., (1991) 265:14597; Tomlinson etai, Eur. J. 
Immunol. (1993) 25:136; Barclay etai The leucocyte antigen factbooks. (1993) Academic 
Press, London/San Diego). The proteins belonging to this family include: 1) Mammalian 
antigen CD9 (MIC3), which is involved in platelet activation and aggregation; 2) 
20 Mammalian leukocyte antigen CD37, expressed on B lymphocytes; 3) Mammalian 

leukocyte antigen CD53 (OX-44), which is implicated in growth regulation in hematopoietic 
cells; 4) Mammalian lysosomal membrane protein CD63 (melanoma-associated antigen 
ME491; antigen AD1); 5) Mammalian antigen CD81 (cell surface protein TAPA-1), which 
is implicated in regulation of lymphoma cell growth; 6) Mammalian antigen CD82 (protein 
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R2; antigen C33; Kangai 1 (KAI1)), which associates with CD4 or CD8 and delivers 

costimulatory signals for the TCR/CD3 pathway; 7) Mammalian antigen CD151 (SFA-1; 

platelet-endothelial tetraspan antigen 3 (PETA-3)); 8) Mammalian cell surface glycoprotein 

A15 (TALL A- 1 ; MXS1); 9) Mammalian novel antigen 2 (NAG-2); 10) Human tumor- 

5 associated antigen CO-029; 1 1) Schistosoma mansoni and japonicum 23 Kd surface antigen 

(SM23 / SJ23). 

The members of the 4 transmembrane family share several characteristics. First, they 
all are apparently type III membrane proteins, which are integral membrane proteins 
containing an N-terminal membrane-anchoring domain which is not cleaved during 

10 biosynthesis and which functions both as a translocation signal and as a membrane anchor. 
The family members also contain three additional transmembrane regions, at least seven 
conserved cysteines residues, and are of approximately the same size (218 to 284 residues). 
These proteins are collectively know as the "transmembrane 4 superfamily" (TM4) because 
they span plasma membrane four times. A schematic diagram of the domain structure of 

1 5 these proteins is as follows: 

+-+ + + — + + + + + — + 

1 1 TMa | Extra | TM2| Cyt | TM3 | Extracellular | TM4 | Cyt| 

+-+ + + — C C + CC— C C— + C — + 

********* 

20 where Cyt is the cytoplasmic domain, TMa is the transmembrane anchor; TM2 to TM4 
represents transmembrane regions 2 to 4, 'C are conserved cysteines, and •* Indicates the 
position of the consensus pattern. The consensus pattern spans a conserved region including 
two cysteines located in a short cytoplasmic loop between two transmembrane domains: 
Consensus pattern: G-x(3)-[LIVMF].x(2)-[GSA]-[LIVMF](2)-G-C-x-[GA]-[STA]- x(2)- 

25 [EG]-x(2)-[CWN]-[LIVM](2). 

b) Seven Transmembrane Integral Membrane Proteins. SEQ IDNOS: 24, 41, 101, 
157, 291, 305, 315, and 341 correspond to a sequence encoding a polypeptide that is a 
member of the seven transmembrane receptor family. G-protein coupled receptors 
(Strosberg, Eur. J. Biochem. (1991) 196:1; Kerlavage, Curr. Opin. Struct. Biol. (1991) 

30 7:394; and Probsts al. 9 DNACell Biol (1992) 77:1; and Savarese etal, Biochem. J. (1992) 

293:1) (also called R7G) are an extensive group of hormones, neurotransmitters, odorants 

and light receptors which transduce extracellular signals by interaction with guanine 

nucleotide-binding (G) proteins. The tertiary structure of these receptors is thought to be 

highly similar. They have seven hydrophobic regions, each of which most probably spans 
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the membrane. The N-terminus is located on the extracellular side of the membrane and is 

often glycosylated, while the C-terminus is cytoplasmic and generally phosphorylated. 

Three extracellular loops alternate with three intracellular loops to link the seven 

transmembrane regions. Most, but not all of these receptors, lack a signal peptide. The most 

5 conserved parts of these proteins are the transmembrane regions and the first two 

cytoplasmic loops. A conserved acidic-Arg-aromatic triplet is present in the N-terminal 

extremity of the second cytoplasmic loop (Attwood et al 9 Gene (1991) P5:153) and could 

be implicated in the interaction with G proteins. 

To detect this widespread family of proteins a pattern is used that contains the 

1 0 conserved triplet and that also spans the major part of the third transmembrane helix. 

Additional information about the seven transmembrane receptor family, and methods for 
their identification and use, is found in U.S. Patent No. 5,759,804. Due in part to their 
expression on the cell surface and other attractive characteristics, seven transmembrane 
protein family members are of particular interest as drug targets, as surface antigen markers, 

1 5 and as drug delivery targets (e.g. , using antibody-drug complexes and/or use of anti-seven 
transmembrane protein antibodies as therapeutics in their own right). 

c) Ank Repeats. SEQ ID NOS: 116 and 251 represent polynucleotides encoding Ank 
repeat-containing proteins. The ankyrin motif is a 33 amino acid sequence named after the 
protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank repeats were originally 

20 identified in the cell-cycle-control protein cdclO (Breeden et al, Nature (1987) 329:651), 

Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle 
protein cdclO, the Notch receptor (Matsuno et al, Development (1997) 124(21):4265); G9a 
(or BAT8) of the class III region of the major histocompatibility complex (Biochem J. 
290:81 1-818, 1993), FABP, GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions 

25 of the ankyrin repeats are compatible with a role in protein-protein interactions (Bork, 

Proteins (1993) 17(4):363; Lambert and Bennet, Eur. J. Biochem. (1993) 277:1; Kerr et al, 
CurrentOp. Cell Biol (1992) 4:496; Bennet etaUJ.Biol Chem.(m0) 255:6424). 

The 90 kD N-terminal domain of ankyrin contains a series of 24 33-amino-acid ank 
repeats. (Lux et al., Nature (1990) 544:36-42, Lambert et al, PNAS USA (1990) 57:1730.) 

30 The 24 ank repeats form four folded subdomains of 6 repeats each. These four repeat 

subdomains mediate interactions with at least 7 different families of membrane proteins. 

Ankyrin contains two separate binding sites for anion exchanger dimers. One site utilizes 

repeat subdomain two (repeats 7-12) and the other requires both repeat subdomains 3 and 4 

(repeats 13-24). Since the anion exchangers exist in dimers, ankyrin binds 4 anion 
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exchangers at the same time. (Michaely and Bennett, J. Biol Chem. (1995) 270(37) .22050) 

The repeat motifs are involved in ankyrin interaction with tubulin, spectrin, and other 

membrane proteins. (Lux et aL, Nature (1990) 344:36.) 

The Rel/NF-kappaB/Dorsal family of transcription factors have activity that is 
controlled by sequestration in the cytoplasm in association with inhibitory proteins referred 
to as I-kappaB. (Gilmore, Cell (1990) 52:841; Nolan and Baltimore, Curr Opin Genet Dev. 
(1992) 2:21 1; Baeuerle, Biochim Biophys Acta (1991) 1072:63; Schmitz et aL, Trends Cell 
Biol. (1991) 7:130.) I-kappaB proteins contain 5 to 8 copies of 33 amino acid ankyrin 
repeats and certain NF-kappaB/rel proteins are also regulated by cis-acting ankyrin repeat 
containing domains including pl05NF-kappaB which contains a series of ankyrin repeats 
(Diehl and Hannink, 1 Virol (1993) 67(1 2) .7161). The I-kappaBs and Cactus (also 
containing ankyrin repeats) inhibit activators through differential interactions with the Rel- 
homology domain. The gene family includes proto-oncogenes, thus broadly implicating I- 
kappaB in the control of both normal gene expression and the aberrant gene expression that 
makes cells cancerous. (Nolan and Baltimore, Curr Opin Genet Dev. (1992) 2(2) :2\ 1-220). 
In the case of rel/NF-kappaB and pp40/I-kappaBp, both the ankyrin repeats and the carboxy- 
terminal domain are required for inhibiting DNA-binding activity and direct association of 
pp40/I-kappaBp with rel/NF-kappaB protein. The ankyrin repeats and the carboxy-terminal 
of pp40/I-kappaBp ( form a structure that associates with the rel homology domain to inhibit 
DNA binding activity (Inoue et al. 9 PNAS USA (1992) 59:4333). 

The 4 ankyrin repeats in the amino terminus of the transcription factor subunit 
GABPp are required for its interaction with the GABPot subunit to form a functional high 
affinity DNA-binding protein. These repeats can be crosslinked to DNA when GABP is 
bound to its target sequence. (Thompson et al, Science (1991) 255:762; LaMarco et aL, 
Science (1991) 255:789). 

Myotrophin, a 12.5 kDa protein having a key role in the initiation of cardiac 
hypertrophy, comprises ankyrin repeats. The ankyrin repeats are characteristic of a hairpin- 
like protruding tip followed by a helix-turn-helix motif. The V-shaped helix-turn-helix of 
the repeats stack sequentially in bundles and are stabilized by compact hydrophobic cores, 
whereas the protruding tips are less ordered. 

d) ATPases Associated with Various Cellular Activities (AAA). SEQ ID NOS: 63, 
1 1 6, 1 34, 1 36, 1 5 1 , 3 84, and 404 polynucleotides encoding novel members of the "ATPases 
Associated with diverse cellular Activities" (AAA) protein family The AAA protein family 
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is composed of a large number of ATPases that share a conserved region of about 220 amino 

acids that contains an ATP-binding site (Froehlich et aL, J. Cell BioL (1991) 1 14:443; 

Erdmann et aL Cell ( 1 99 1 ) 64:499; Peters et aL, EMBO J. ( 1 990) 9: 1 757; Kunau et aL, 

Biochimie (1993) 75:209-224; Confalonieri etal, BioEssays (1995) 77:639; 

5 http://yeamob.pci.chemie.uni-tuebingen.de/AAA/Description.html). The proteins that 

belong to this family either contain one or two AAA domains. 

Proteins containing two AAA domains include: 1) Mammalian and drosophila NSF 

(N-ethylmaleimide-sensitive fusion protein) and the fungal homolog, SEC 18, which are 

involved in intracellular transport between the endoplasmic reticulum and Golgi, as well as 

10 between different Golgi cisternae; 2) Mammalian transitional endoplasmic reticulum 

ATPase (previously known as p97 or VCP), which is involved in the transfer of membranes 
from the endoplasmic reticulum to the golgi apparatus. This ATPase forms a ring-shaped 
homooligomer composed of six subunits. The yeast homolog, CDC48, plays a role in 
spindle pole proliferation; 3) Yeast protein PAS1 essential for peroxisome assembly and the 

1 5 related protein PAS 1 from Pichia pastoris; 4) Yeast protein AFG2; 5) Sulfolobus 

acidocaldarius protein S AV and Halobacterium salinarium cdcH, which may be part of a 
transduction pathway connecting light to cell division. 

Proteins containing a single AAA domain include: 1) Escherichia coli and other 
bacteria ftsH (or hflB) protein. FtsH is an ATP-dependent zinc metallopeptidase that 

20 degrades the heat-shock sigma-32 factor, and is an integral membrane protein with a large 
cytoplasmic C-terminal domain that contain both the AAA and the protease domains; 2) 
Yeast protein YME1, a protein important for maintaining the integrity of the mitochondrial 
compartment. YME1 is also a zinc-dependent protease; 3) Yeast protein AFG3 (or YTA10). 
This protein also contains an AAA domain followed by a zinc-dependent protease domain; 

25 4) Subunits from regulatory complex of the 26S proteasome (Hilt et al t Trends Biochem. 
ScL (1996) 27 :96), which is involved in the ATP-dependent degradation of ubiquitinated 
proteins, which subunits include: a) Mammalian 4 and homologs in other higher eukaryotes, 
in yeast (gene YTA5) and fission yeast (gene mts2); b) Mammalian 6 (TBP7) and homologs 
in other higher eukaryotes and in yeast (gene YTA2); c) Mammalian subunit 7 (MSS1) and 

30 homologs in other higher eukaryotes and in yeast (gene CIM5 or YTA3); d) Mammalian 

subunit 8 (P45) and homologs in other higher eukaryotes and in yeast (SUG1 or CIM3 or 

TBY1) and fission yeast (gene letl); e) Other probable subunits include human TBP1, which 

influences HIV gene expression by interacting with the virus tat transactivator protein, and 

yeast YTA1 and YTA6; 5) Yeast protein BCS1, a mitochondrial protein essential for the 
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expression of the Rieske iron-sulfur protein; 6) Yeast protein MSP1, a protein involved in 

intramitochondrial sorting of proteins; 7) Yeast protein PAS8, and the corresponding 

proteins PAS5 from Pichia pastoris and PAY4 from Yarrowia lipolytica; 8) Mouse protein 

SKD1 and its fission yeast homolog (SpAC2Gl 1.06); 9) Caenorhabditis elegans meiotic 

5 spindle formation protein mei-1; 10) Yeast protein SAP1 9 1 1) Yeast protein YTA7; and 12) 

Mycobacterium leprae hypothetical protein A2126A. 

In general, the AAA domains in these proteins act as ATP-dependent protein 

clamps(Confalonieri et al (1995) BioEssays 77:639). In addition to the ATP-binding 'A' and 

'B' motifs, which are located in the N-terminal half of this domain, there is a highly 

1 0 conserved region located in the central part of the domain which was used in the 

development of the signature pattern. The consensus pattern is: [LIVMT]-x-[LIVMT]- 

[LIVMF]-x-[GATMC]-[ST]-[NS]-x(4)-[LIVM]- D-x-A-[LIFA]-x-R. 

e) Basic Region Plus Leucine Zipper Transcription Factors. SEQ ID NO:374 
correspond to a polynucleotide encoding a novel member of the family of basic region plus 

15 leucine zipper transcription factors. The bZIP superfamily (Hurst, Protein Prof. (1995) 
2:105; and Ellenberger, Curr. Opin. Struct. Biol (1994) 4:12) of eukaryotic DNA-binding 
transcription factors encompasses proteins that contain a basic region mediating sequence- 
specific DNA-binding followed by a leucine zipper required for dimerization. Members of 
the family include transcription factor AP-1, which binds selectively to enhancer elements in 

20 the cis control regions of S V40 and metallothionein IIA. AP- 1 , also known as c-jun, is the 
cellular homolog of the avian sarcoma virus 17 (AS VI 7) oncogene v-jun. 

Other members of this protein family include jun-B and jun-D, probable 
transcription factors that are highly similar to jun/AP-1; the fos protein, a proto-oncogene 
that forms a non-covalent dimer with c-jun; the fos-related proteins fra-1, and fos B; and 

25 mammalian cAMP response element (CRE) binding proteins CREB, CREM, ATF-1, ATF- 
3, ATF-4, ATF-5, ATF-6 and LRF-1 . The consensus pattern for this protein family is: 
[KR]-x(l,3)-[RKSAQ].N-x(2)-[SAQ](2)-x-[RKTAENQ]-x-R-x-[RK]. 

f) Bromodomain. SEQ ID NO:97 corresponds to a polynucleotide encoding a 
polypeptide having a bromodomain region (Haynes et al, 1992, Nucleic Acids Res. 

30 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and Tamkun, 1995, Curr. Opin. Genet. 

Dev. 5:473-477), which is a conserved region of about 70 amino acids found in the 

following proteins: 1) Higher eukaryotes transcription initiation factor TFIID 250 Kd 

subunit (TBP-associated factor p250) (gene CCG1); P250 is associated with the TFIID 

TATA-box binding protein and seems essential for progression of the Gl phase of the cell 
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cycle. 2) Human RING3, a protein of unknown function encoded in the MHC class II locus; 

3) Mammalian CREB-binding protein (CBP) 5 which mediates cAMP-gene regulation by 

binding specifically to phosphorylated CREB protein; 4) Mammalian homologs of brahma, 

including three brahma-like human: SNF2a(hBRM), SNF2b, and BRG1; 5) Human BS69, 

5 a protein that binds to adenovirus El A and inhibits El A transactivation; 6) Human peregrin 

(or Br 140). 

The bromodomain is thought to be involved in protein-protein interactions and may 
be important for the assembly or activity of multicomponent complexes involved in 
transcriptional activation. The consensus pattern, which spans a major part of the 
1 0 bromodomain, is: [STANVF]-x(2)-F-x(4)-[DNS]-x(5,7)-[DENQTF]- Y-[HFY]-x(2)- 

[LIVMFY]-x(3)-[LIVM]-x(4)-[LIVM]-x(6 5 8)-Y-x(12 5 13)-[LIVM]-x(2).N-[SACF]-x(2)- 
[FY]. 

g) EF-Hand. SEQ ID NOS: 1 36, 242, and 379 correspond to polynucleotides 
encoding a novel protein in the family of EF-hand proteins. Many calcium-binding proteins 

1 5 belong to the same evolutionary family and share a type of calcium-binding domain known 
as the EF-hand (Kawasaki et al., Protein. Prof. (1995) 2:305-490). This type of domain 
consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical 
domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal 
configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; 

20 these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 
12 provides two oxygens for liganding Ca (bidentate ligand). 

Proteins known to contain EF-hand regions include: Calmodulin (Ca=4, except in 
yeast where Ca=3) ("Ca=" indicates approximate number of EF-hand regions); 
diacylglycerol kinase (EC 2.7.1.107) (DGK) (Ca=2); 2) FAD-dependent glycerol-3- 

25 phosphate dehydrogenase (EC 1 .1.99.5) from mammals (Ca=l); guanylate cyclase activating 
protein (GCAP) (Ca=3); MIF related proteins 8 (MRP-8 or CFAG) and 14 (MRP-14) 
(Ca=2); myosin regulatory light chains (Ca=l); oncomodulin (Ca=2); osteonectin (basement 
membrane protein BM-40) (SPARC); and proteins that contain an "osteonectin" domain 
(QR1, matrix glycoprotein SCI). 

30 The consensus pattern includes the complete EF-hand loop as well as the first residue 

which follows the loop and which seem to always be hydrophobic. 

Consensus pattern: D-x-[DNS]-{ILVFYW}-[DENSTG]-[DNQGHRK]-{GP}- 
[LIVMC]-[DENQSTAGC]-x(2)-pE]-[LIVMFYW] 
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h) Eukarvotic Aspartvl Proteases! SEQ ID NO:308 corresponds to a gene encoding a 

novel eukaryotic aspartyl protease. Aspartyl proteases, known as acid proteases, (EC 
3.4.23.-) are a widely distributed family of proteolytic enzymes (Foltmann B., Essays 
Biochem. (1981) 77:52; Davies D.R., Annu. Rev. Biophys. Chem. (1990) 7P:189; Rao 
5 J.K.M., et ai 9 Biochemistry (1991) 50:4663) known to exist in vertebrates, fungi, plants, 
retroviruses and some plant viruses. Aspartate proteases of eukaryotes are monomeric 
enzymes which consist of two domains. Each domain contains an active site centered on a 
catalytic aspartyl residue. The two domains most probably evolved from the duplication of 
an ancestral gene encoding a primordial domain. Currently known eukaryotic aspartyl 

10 proteases include: 1) Vertebrate gastric pepsins A and C (also known as gastricsin); 2) 
Vertebrate chymosin (rennin), involved in digestion and used for making cheese; 3) 
Vertebrate lysosomal cathepsins D (EC 3.4.23.5) and E (EC 3.4.23.34); 4) Mammalian renin 
(EC 3.4.23.15) whose function is to generate angiotensin I from angiotensinogen in the 
plasma; 5) Fungal proteases such as aspergillopepsin A (EC 3.4.23.18), candidapepsin (EC 

15 3.4.23.24), mucoropepsin (EC 3.4.23.23) (mucor rennin), endothiapepsin (EC 3.4.23.22), 
polyporopepsin (EC 3.4.23.29), and rhizopuspepsin (EC 3.4.23.21); and 6) Yeast 
saccharopepsin (EC 3.4.23.25) (proteinase A) (gene PEP4). PEP4 is implicated in 
posttranslational regulation of vacuolar hydrolases; 7) Yeast barrierpepsin (EC 3.4.23.35) 
(gene BAR1); a protease that cleaves alpha-factor and thus acts as an antagonist of the 

20 mating pheromone; and 8) Fission yeast sxal which is involved in degrading or processing 
the mating pheromones. 

Most retroviruses and some plant viruses, such as badnaviruses, encode for an 
aspartyl protease which is an homodimer of a chain of about 95 to 125 amino acids. In most 
retroviruses, the protease is encoded as a segment of a polyprotein which is cleaved during 

25 the maturation process of the virus. It is generally part of the pol polyprotein and, more 

rarely, of the gag polyprotein. Because the sequence around the two aspartates of eukaryotic 
aspartyl proteases and around the single active site of the viral proteases is conserved, a 
single signature pattern can be used to identify members of both groups of proteases. The 
consensus pattern is: [LIVMFGAC]-[LIVMTADN]-[LIVFSA]-D-[ST]-G-[STAV]- 

30 [STAPDENQ]- x-[LIVMFSTNC]-x-[LIVMFGTA], where D is the active site residue. 

i) GATA Family of Transcription Factors. SEQ ID NO:2 1 3 corresponds to a novel 

member of the GATA family of transcription factors. The GATA family of transcription 

factors are proteins that bind to DNA sites with the consensus sequence (A/T)GATA(A/G), 

found within the regulatory region of a number of genes. Proteins currently known to belong 
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to this family are: 1) GATA-1 (Trainor, CD., et al, Nature (1990) 343:92) (also known as 

Eryfl, GF-1 or NF-E1), which binds to the GATA region of globin genes and other genes 

expressed in erythroid cells. It is a transcriptional activator which probably serves as a 

general 'switch' factor for erythroid development; 2) GATA-2 (Lee, M.E., et ah, J. Biol. 

5 Chem. ( 1 99 1 ) 266: 1 6 1 88), a transcriptional activator which regulates endothelin- 1 gene 

expression in endothelial cells; 3) GATA-3 (Ho, I.-C, et al, EMBO J. (1991) 70:1 187), a 

transcriptional activator which binds to the enhancer of the T-cell receptor alpha and delta 

genes; 4) GATA-4 (Spieth, J., etal, Mol Cell Biol (1991) 77:4651), a transcriptional 

activator expressed in endodermally derived tissues and heart; 5) Drosophila protein pannier 

10 (or DGATAa) (gene pnr) which acts as a repressor of the achaete-scute complex (as-c); 6) 
Bombyx mori BCFI (Drevet, J.R., et al, J. Biol Chem. (1994) 269:10660), which regulates 
the expression of chorion genes; 7) Caenorhabditis elegans elt-1 and elt-2, transcriptional 
activators of genes containing the GATA region, including vitellogenin genes (Hawkins, 
M.G., et al, J. Biol Chem. (1995) 270:14666); 8) Ustilago maydis urbsl (Voisard, C.P.O., 

1 5 et al, Mol Cell Biol (1993) 7J:7091), a protein involved in the repression of the 
biosynthesis of siderophores; 9) Fission yeast protein GAF2. 

All these transcription factors contain a pair of highly similar 'zinc finger' type 
domains with the consensus sequence C-x2-C-xl7-C-x2-C. Some other proteins contain a 
single zinc finger motif highly related to those of the GATA transcription factors. These 

20 proteins are: 1) Drosophila box A-binding factor (ABF) (also known as protein serpent 

(gene srp)) which may function as a transcriptional activator protein and may play a key role 
in the organogenesis of the fat body; 2) Emericella nidulans are (Arst, H.N., Jr., et al, 
Trends Genet. (1989) 5:291) a transcriptional activator which mediates nitrogen metabolite 
repression; 3) Neurospora crassa nit-2 (Fu, Y.-H., etal, Mol Cell Biol (1990) 70:1056), a 

25 transcriptional activator which turns on the expression of genes coding for enzymes required 
for the use of a variety of secondary nitrogen sources, during conditions of nitrogen 
limitation; 4) Neurospora crassa white collar proteins 1 and 2 (WC-1 and WC-2), which 
control expression of light-regulated genes; 5) Saccharomyces cerevisiae DAL81 (or 
UGA43), a negative nitrogen regulatory protein; 6) Saccharomyces cerevisiae GLN3, a 

30 positive nitrogen regulatory protein; 7) Saccharomyces cerevisiae GAT1; 8) Saccharomyces 
cerevisiae GZF3. 

The consensus pattern for the GATA family is: C-x-[DN]-C-x(4,5)-[ST]-x(2)-W- 
[HR]-[RK]-x(3)-[GN]-x(3,4)-C-N-[AS]-C, where the four C's are zinc ligands. 
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j) G-Protein Alpha Subunit. SEQ ID NO:367 corresponds to a gene encoding a novel 

polypeptide of the G-protein alpha subunit family. Guanine nucleotide binding proteins (G- 

proteins) are a family of membrane-associated proteins that couple extracellularly-activated 

integral-membrane receptors to intracellular effectors, such as ion channels and enzymes that 

5 vary the concentration of second messenger molecules. G-proteins are composed of 3 

subunits (alpha, beta and gamma) which, in the resting state, associate as a trimer at the inner 

face of the plasma membrane. The alpha subunit has a molecule of guanosine diphosphate 

(GDP) bound to it. Stimulation of the G-protein by an activated receptor leads to its 

exchange for GTP (guanosine triphosphate). This results in the separation of the alpha from 

1 0 the beta and gamma subunits, which always remain tightly associated as a dimer. Both the 
alpha and beta-gamma subunits are then able to interact with effectors, either individually or 
in a cooperative manner. The intrinsic GTPase activity of the alpha subunit hydrolyses the 
bound GTP to GDP. This returns the alpha subunit to its inactive conformation and allows it 
to reassociate with the beta-gamma subunit, thus restoring the system to its resting state. 

15 G-protein alpha subunits are 350-400 amino acids in length and have molecular 

weights in the range 40-45 kDa. Seventeen distinct types of alpha subunit have been 
identified in mammals. These fall into 4 main groups on the basis of both sequence 
similarity and function: alpha-s, alpha-q, alpha-i and alpha- 12 (Simon etal, Science (1993) 
252:802). Many alpha subunits are substrates for ADP-ribosylation by cholera or pertussis 

20 toxins. They are often N-terminally acylated, usually with myristate and/or palmitoylate, 
and these fatty acid modifications are probably important for membrane association and 
high- affinity interactions with other proteins. The atomic structure of the alpha subunit of 
the G-protein involved in mammalian vision, transducin, has been elucidated in both GTP- 
and GDB-bound forms, and shows considerable similarity in both primary and tertiary 

25 structure in the nucleotide-binding regions to other guanine nucleotide binding proteins, such 
as p21-ras and EF-Tu. 

k) Phorbol Esters/Diacvlglvcerol Binding. SEQ ID NO: 1 88 and 25 1 represent 
polynucleotides encoding a protein belonging to the family including phorbol 
esters/diacylglycerol binding proteins. Diacylglycerol (DAG) is an important second 

30 messenger. Phorbol esters (PE) are analogues of DAG and potent tumor promoters that 

cause a variety of physiological changes when administered to both cells and tissues. DAG 

activates a family of serine/threonine protein kinases, collectively known as protein kinase C 

(PKC) (Azzi et ai, Eur. 1 Biochem. (1 992) 205:547). Phorbol esters can directly stimulate 

PKC. The N-terminal region of PKC, known as CI, has been shown (Ono et al. 9 Proc. Natl 
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Acad Sci. USA (1989) 55:4868) to bind PE and DAG in a phospholipid and zinc-dependent 

fashion. The CI region contains one or two copies (depending on the isozyme of PKC) of a 

cysteine-rich domain about 50 amino-acid residues long and essential for DAG/PE-binding. 

Such a domain has also been found in, for example, the following proteins. 

5 ( 1 ) Diacylglycerol kinase (EC 2.7.1.1 07) (DGK) (Sakane et al , Nature ( 1 990) 

344:345), the enzyme that converts DAG into phosphatidate. It contains two copies of the 

DAG/PE-binding domain in its N-terminal section. At least five different forms of DGK are 

known in mammals; and 

(2) N-chimaerin, a brain specific protein which shows sequence similarities with the 

10 BCR protein at its C-terminal part and contains a single copy of the DAG/PE-binding 

domain at its N-terminal part. It has been shown (Ahmed et al, Biochem. J. (1990) 272:767, 
and Ahmed et al, Biochem. J. (1991) 280:233) to be able to bind phorbol esters. 

The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are 
probably the six cysteines and two histidines that are conserved in this domain. The 

1 5 signature pattern completely spans the DAG/PE domain. The consensus pattern is: H-x- 
[LIVMFYW]-x(8,l l)-C-x(2)^ 
C. All the C and H are probably involved in binding zinc. 

1) Protein Kinase. SEQ ID NOS:202, 315, 367, and 397 represent polynucleotides 
encoding protein kinases. Protein kinases catalyze phosphorylation of proteins in a variety of 

20 pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks S.K., et al, 
FASEBJ. (1995) 9:576; Hunter T., Metk Enzymol (1991) 200:3; Hanks S.K., etal,Meth. 
Enzymol (1991) 200:38; Hanks S.K., Curr. Opin. Struct. Biol (1991) 7:369; Hanks S.K., et 
al, Science (1988) 247:42) are enzymes that belong to a very extensive family of proteins 
which share a conserved catalytic core common to both serine/threonine and tyrosine protein 

25 kinases. There are a number of conserved regions in the catalytic domain of protein kinases. 
Two of the conserved regions are the basis for the signature pattern in the protein kinase 
profile. The first region, which is located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has 
been shown to be involved in ATP binding. The second region, which is located in the 

30 central part of the catalytic domain, contains a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (Knighton D.R., et al, Science (1991) 
253:407). The protein kinase profile includes two signature patterns for this second region: 
one specific for serine/threonine kinases and the other for tyrosine kinases. A third profile is 
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based on the alignment in (Hanks S.K., et al, FASEB J. (1995) 9:576) and covers the entire 

catalytic domain. The consensus patterns are as follows: 

1) Consensus pattern: [LIV]-G-{P}-G-{P}-[FYWMGSTNH]-[SGA]-{PW}- 
[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5,18)-[LIVMFYWCSTAR]-[AIVP]- 

5 [LIVMFAGCKR]-K, where K binds ATP. The majority of known protein kinases are 
detected by this pattern. Proteins kinases that are not detected by this consensus include 
viral kinases, which are quite divergent in this region and are completely missed by this 
pattern. 

2) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N- 

1 0 [LIVMFYCT](3), where D is an active site residue. This consensus sequence identifies most 
serine/threonine-specific protein kinases with only 10 exceptions. Half of the exceptions are 
viral kinases, while the other exceptions include Epstein-Barr virus BGLF4 and Drosophila 
ninaC, which have Ser and Arg, respectively, instead of the conserved Lys. These latter two 
protein kinases are detected by the tyrosine kinase specific pattern described below. 

1 5 3) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LI VMFY]-[RSTAC]-x(2)-N- 

[LIVMFYC], where D is an active site residue. All tyrosine-specific protein kinases are 
detected by this consensus pattern, with the exception of human ERBB3 and mouse blk. 
This pattern also detects most bacterial aminoglycoside phosphotransferases (Benner S., 
Nature (1987) 529:21 ; Kirby R., 1 Mol Evol (1992) 50:489) and herpesviruses ganciclovir 

20 kinases (Littler E., et al , Nature (1 992) 358: 1 60), which are structurally and evolutionary 
related to protein kinases. 

The protein kinase profile also detects receptor guanylate cyclases and 2-5A- 
dependent ribonucleases. Sequence similarities between these two families and the 
eukaryotic protein kinase family have been noticed previously. The profile also detects 

25 Arabidopsis thaliana kinase-like protein TMKL1 which seems to have lost its catalytic 
activity. 

If a protein analyzed includes the two of the above protein kinase signatures, the 
probability of it being a protein kinase is close to 100%. Eukaryotic-type protein kinases 
have also been found in prokaryotes such as Myxococcus xanthus (Munoz-Dorado J., et al, 
30 Cell (1991) 67:995) and Yersinia pseudotuberculosis. The patterns shown above has been 
updated since their publication in (Bairoch A., et al, Nature (1988) 331:22). 

m) Protein Phosphatase 2C. SEQ ID NO:256 corresponds to a polynucleotide 

encoding a novel protein phosphatase 2C (PP2C), which is one of the four major classes of 

mammalian serine/threonine specific protein phosphatases. PP2C (Wenk et al, FEBS Lett. 

98 



WO 99/33982 PCT/US98/2761 0 

(1992) 297:135) is a monomeric enzyme of about 42 Kd which shows broad substrate 

specificity and is dependent on divalent cations (mainly manganese and magnesium) for its 

activity. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. 

n) Protein Tyrosine Phosphatase. SEQ ID NO:382 represents a polynucleotide 

5 encoding a protein tyrosine kinase. Tyrosine specific protein phosphatases (EC 3.1 .3.48) 

(PTPase) (Fischer et al, Science (1991) 253:401 ; Charbonneau et al. 9 Annu. Rev. Cell Biol 

(1992) 5:463; Trowbridge, 1 Biol Chem. (1991) 2(5^:23517; Tonks et al, Trends Biochem. 

Sci. (1989) 74:497; and Hunter, Cell (1989) 55:1013) catalyze the removal of a phosphate 

group attached to a tyrosine residue. These enzymes are very important in the control of cell 

10 growth, proliferation, differentiation and transformation. Multiple forms of PTPase have 
been characterized and can be classified into two categories: soluble PTPases and 
transmembrane receptor proteins that contain PTPase domain(s). 

Soluble PTPases include PTPN3 (HI) and PTPN4 (MEG), enzymes that contain an 
N-terminal band 4.1 -like domain and could act at junctions between the membrane and 

1 5 cytoskeleton; PTPN6 (PTP-1 C; HCP; SHP) and PTPN1 1 (PTP-2C; SH-PTP3; Syp), 
enzymes that contain two copies of the SH2 domain at its N-terminal extremity. 

Dual specificity PTPases include DUSP1 (PTPN10; MAP kinase phosphatase- 1; 
MKP-1) which dephosphorylates MAP kinase on both Thr-183 and Tyr-185; and DUSP2 
(PAC-1), a nuclear enzyme that dephosphorylates MAP kinases ERK1 and ERK2 on both 

20 Thr and Tyr residues. 

Structurally, all known receptor PTPases are made up of a variable length 
extracellular domain, followed by a transmembrane region and a C-terminal catalytic 
cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-IH) 
repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains 

25 in their extracellular region. The cytoplasmic region generally contains two copies of the 
PTPAse domain. The first seems to have enzymatic activity, while the second is inactive but 
seems to affect substrate specificity of the first. In these domains, the catalytic cysteine is 
generally conserved but some other, presumably important, residues are not. 

PTPase domains consist of about 300 amino acids. There are two conserved 

30 cysteines and the second one has been shown to be absolutely required for activity. 

Furthermore, a number of conserved residues in its immediate vicinity have also been shown 
to be important. The consensus pattern for PTPases is: [LIVMF]-H-C-x(2)-G-x(3)-[STC]- 
[STAGP]-x-[LIVMFY]; C is the active site residue. 
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o) SH3 Domain. SEQ ID NO:306 and 386 represent polynucleotides encoding SH3 

domain proteins. The Src homology 3 (SH3) domain is a small protein domain of about 60 
amino acid residues first identified as a conserved sequence in the non-catalytic part of 
several cytoplasmic protein tyrosine kinases (e.g. Src, Abl, Lck) (Mayer et aL, Nature (1988) 
5 332:272). The domain has also been found in a variety of intracellular or membrane- 
associated proteins (Musacchio et aL, FEBSLett. (1992) 507:55; Pawson et aL, Curr. Biol. 
(1993) 5:434; Mayer et aL, Trends Cell Biol. (1993) 5:8; and Pawson et aL, Nature (1995) 
575:573). 

The SH3 domain has a characteristic fold that consists of five or six beta-strands 

10 arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain 

short helices (Kuriyan et aL, Curr. Opin. Struct. Biol. (1993) 5:828). It is believed that SH3 
domain-containing proteins mediate assembly of specific protein complexes via binding to 
proline-rich peptides (Morton et aL, Curr. Biol. (1994) 4:615). In general, SH3 domains are 
found as single copies in a given protein, but there is a significant number of proteins with 

1 5 two SH3 domains and a few with 3 or 4 copies. 

SH3 domains have been identified in, for example, protein tyrosine kinases, such as 
the Src, Abl, Bkt, Csk and ZAP70 families of kinases; mammalian phosphatidylinositol- 
specific phospholipase C-gamma-1 and -2; mammalian phosphatidyl inositol 3-kinase 
regulatory p85 subunit; mammalian Ras GTPase-activating protein (GAP); mammalian Vav 

20 oncoprotein, a guanine nucleotide exchange factor of the CDC24 family; Drosophila 
lethal(l)discs large-1 tumor suppressor protein (gene Dlgl); mammalian tight junction 
protein ZO-1; vertebrate erythrocyte membrane protein p55; Caenorhabditis elegans protein 
lin-2;rat protein CASK; and mammalian synaptic proteins SAP90/PSD-95, CHAPS YN- 
1 10/PSD-93, SAP97/DLG1 and SAP 102. Novel SH3-domain containing polypeptides will 

25 facilitate elucidation of the role of such proteins in important biological pathways, such as 
ras activation. 

p) Trypsin. SEQ ID NO: 169 corresponds to a novel serine protease of the trypsin 
family. The catalytic activity of the serine proteases from the trypsin family is provided by a 
charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which 
30 itself is hydrogen-bonded to a serine. The sequences in the vicinity of the active site serine 
and histidine residues are well conserved in this family of proteases (Brenner S., Nature 
(1988) 554:528). Proteases known to belong to the trypsin family include: 1) Acrosin; 2) 
Blood coagulation factors VII, IX, X, XI and XII, thrombin, plasminogen, and protein C; 3) 
Cathepsin G; 4) Chymotrypsins; 5) Complement components Clr, Cls, C2, and complement 
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factors B, D and I; 6) Complement-activating component of RA-reactive factor; 7) Cytotoxic 

cell proteases (granzymes A to H); 8) Duodenase I; 9) Elastases 1, 2, 3 A, 3B (protease E), 

leukocyte (medullasin).; 10) Enterokinase (EC 3.4.21.9) (enteropeptidase); 11) Hepatocyte 

growth factor activator; 12) Hepsin; 13) Glandular (tissue) kallikreins (including EGF- 

5 binding protein types A, B, and C, NGF-gamma chain, gamma-renin, prostate specific 

antigen (PSA) and tonin); 14) Plasma kallikrein; 15) Mast cell proteases (MCP) 1 (chymase) 

to 8; 16) Myeloblasts (proteinase 3) (Wegener's autoantigen); 17) Plasminogen activators 

(urokinase-type, and tissue-type); 18) Trypsins I, II, III, and IV; 19) Tryptases; 20) Snake 

venom proteases such as ancrod, batroxobin, cerastobin, flavoxobin, and protein C activator; 

10 21) Collagenase from common cattle grub and collagenolytic protease from Atlantic sand 
fiddler crab; 22) Apolipoprotein(a); 23) Blood fluke cercarial protease; 24) Drosophila 
trypsin like proteases: alpha, easter, snake-locus; 25) Drosophila protease stubble (gene sb); 
and 26) Major mite fecal allergen Der p III. All the above proteins belong to family SI in 
the classification of peptidases (Rawlings N.D., et ai, Meth. Enzymol (1994) 244: 19; 

15 http://www.expasv.ch/cgi-bin/lists7peptidas.txt) and originate from eukaryotic species. It 
should be noted that bacterial proteases that belong to family S2A are similar enough in the 
regions of the active site residues that they can be picked up by the same patterns. 

The consensus patterns for this trypsin protein family are: 1) [LIVM]-[ST]-A- 
[STAGJ-H-C, where H is the active site residue. All sequences known to belong to this class 

20 detected by the pattern, except for complement components C lr and C 1 s, pig plasminogen, 
bovine protein C, rodent urokinase, ancrod, gyroxin and two insect trypsins; 2) 
[DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]- [LIVMFYWH]- 
[LIVMFYSTANQH], where S is the active site residue. All sequences known to belong to 
this family are detected by the above consensus sequences, except for 18 different proteases 

25 which have lost the first conserved glycine. If a protein includes both the serine and the 
histidine active site signatures, the probability of it being a trypsin family serine protease is 
100%. 

q) WD Domain, G-Beta Repeats. SEQ ID NOS:188 and 335 represent novel 
members of the WD domain/G-beta repeat family. Beta-transducin (G-beta) is one of the 
30 three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G 
proteins) which act as intermediaries in the transduction of signals generated by 
transmembrane receptors (Gilman, Annu. Rev. Biochem. (1987) 56:615), The alpha subunit 
binds to and hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but 
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they seem to be required for the replacement of GDP by GTP as well as for membrane 

anchoring and receptor recognition. 

In higher eukaryotes, G-beta exists as a small multigene family of highly conserved 

proteins of about 340 amino acid residues. Structurally, G-beta consists of eight tandem 

5 repeats of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is 

sometimes called a WD-40 repeat). Such a repetitive segment has been shown to exist in a 

number of other proteins including: human LIS1, a neuronal protein involved in type-1 

Iissencephaly; and mammalian coatomer beta' subunit (beta'-COP), a component of a 

cytosolic protein complex that reversibly associates with Golgi membranes to form vesicles 

1 0 that mediate biosynthetic protein transport. 

The consensus pattern for the WD domain/G-Beta repeat family is: [LIVMSTAC]- 

[LIVMFYWSTAGC]4LIMSTAG]-[LIVMSTAGC]-x(2)-pN]-x(2>[LIVMWSTAC]-x- 

[LIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN]. 

r) wnt Family of Developmental Signaling Proteins. SEQ ID NO: 23, 291, 324, 330, 

15 341, and 353 correspond to novel members of the wnt family of developmental signaling 

proteins. Wnt-1 (previously known as int-1), the seminal member of this family, (Nusse R., 

Trends Genet (1988) 4:291) is a proto-oncogene induced by the integration of the mouse 

mammary tumor virus. It is thought to play a role in intercellular communication and seems 

to be a signalling molecule important in the development of the central nervous system 

20 (CNS). The sequence of wnt-1 is highly conserved in mammals, fish, and amphibians. Wnt- 

1 was found to be a member of a large family of related proteins (Nusse R., et aL, Cell 
(1992) 69:1073; McMahon A.P., Trends Genet. (1992) 5:1; Moon R.T., BioEssays (1993) 
75:91) that are all thought to be developmental regulators. These proteins are known as wnt- 

2 (also known as irp), wnt-3, -3 A, -4, -5A, -5B, -6, -7A, -7B, -8, -8B, -9 and -10. At least 
25 four members of this family are present in Drosophila; one of them, wingless (wg), is 

implicated in segmentation polarity. All these proteins share the following features 

characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites 

and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 

seem to adhere to the plasma membrane of the secreting cells and are therefore likely to 

30 signal over only few cell diameters. The consensus pattern, which is based upon a highly 

conserved region including three cysteines, is as follows: C-K-C-H-G-[LIVMT]-S-G-x-C. 

All sequences known to belong to this family are detected by the provided consensus pattern. 

s) Ww/rso5/WWP Domain-Containing Proteins. SEQ ID NOS: 188, 379 , and 395 

represent polynucleotides encoding a polypeptide in the family of WW/rsp5/WWP domain- 
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containing proteins. The WW domain (Bork et aL, Trends Biochem. Sci. (1994) 7P:531; 

Andre et aL, Biochem. Biophys. Res. Commun. (1994) 205:1201; Hofmann et aL, FEBS Lett. 

(1995) 555:153; and Sudol et aL, FEBS Lett. (1995) 369:67), also known as rsp5 or WWP), 

was originally discovered as a short conserved region in a number of unrelated proteins, 

5 among them dystrophin, the gene responsible for Duchenne muscular dystrophy. The 

domain, which spans about 35 residues, is repeated up to 4 times in some proteins. It has 

been shown (Chen et aL, Proc. Natl Acad. Sci. USA (1995) P2:7819) to bind proteins with 

particular proline-motifs, [AP]-P-P-[AP]-Y, and thus resembles somewhat SH3 domains. It 

appears to contain beta-strands grouped around four conserved aromatic positions, generally 

1 0 Trp. The name WW or WWP derives from the presence of these Trp as well as that of a 

conserved Pro. It is frequently associated with other domains typical for proteins in signal 

transduction processes. 

Proteins containing the WW domain include: 

1. Dystrophin, a multidomain cytoskeletal protein. Its longest alternatively 

1 5 spliced form consists of an N-terminal actin-binding domain, followed by 24 spectrin-like 
repeats, a cysteine-rich calcium-binding domain and a C-terminal globular domain. 
Dystrophins form tetramers and is thought to have multiple functions including involvement 
in membrane stability, transduction of contractile forces to the extracellular environment and 
organization of membrane specialization. Mutations in the dystrophin gene lead to muscular 

20 dystrophy of Duchenne or Becker type. Dystrophin contains one WW domain C-terminal of 
the spectrin-repeats. 

2. Vertebrate YAP protein, which is a substrate of an unknown serine kinase. It 
binds to the SH3 domain of the Yes oncoprotein via a proline-rich region. This protein 
appears in alternatively spliced isoforms, containing either one or two WW domains. 

25 3. IQGAP, which is a human GTPase activating protein acting on ras. It 

contains an N-terminal domain similar to fly muscle mp20 protein and a C-terminal ras 
GTPase activator domain. 

For the sensitive detection of WW domains, the profile spans the whole homology 
region as well as a pattern. The consensus for this family is: W-x(9,l 1)-[VFY]-[FYW]- 

30 x(6,7)-[GSTNE]-[GSTQCR]-[FYW]-x(2)-P. 

t) Zinc Finger, C2H2 Type. SEQ ID NO:61, 306, and 386 correspond to 
polynucleotides encoding novel members of the of the C2H2 type zinc finger protein family. 
Zinc finger domains (Klug et aL, Trends Biochem. Sci. (1987) 72:464; Evans et aL, Cell 
(1988) 52:1; Payre et aL, FEBS Lett. (1988) 234:245; Miller et aL, EMBOJ. (1985) 4:1609; 
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and Berg, Proc. Natl. Acad ScL USA (1988) 85:99) are nucleic acid-binding protein 

structures first identified in the Xenopus transcription factor TFIIIA. These domains have 

since been found in numerous nucleic acid-binding proteins. A zinc finger domain is 

composed of 25 to 30 amino acid residues. Two cysteine or histidine residues are positioned 

5 at both extremities of the domain, which are involved in the tetrahedral coordination of a 

zinc atom. It has been proposed that such a domain interacts with about five nucleotides. 

Many classes of zinc fingers are characterized according to the number and positions 

of the histidine and cysteine residues involved in the zinc atom coordination. In the first 

class to be characterized, called C2H2, the first pair of zinc coordinating residues are 

10 cysteines, while the second pair are histidines. A number of experimental reports have 

demonstrated the zinc-dependent DNA or RNA binding property of some members of this 

class. 

Mammalian proteins having a C2H2 zipper include (number in parenthesis indicates 
number of zinc finger regions in the protein): basonuclin (6), BCL-6/LAZ-3 (6), erythroid 

15 krueppel-like transcription factor (3), transcription factors Spl (3), Sp2 (3), Sp3 (3) and 
Sp(4) 3, transcriptional repressor YY1 (4), Wilms' tumor protein (4), EGRl/Krox24 (3), 
EGR2/Krox20 (3), EGR3/Pilot (3), EGR4/AT133 (4), Evi-1 (10), GLI1 (5), GLI2 (4+), 
GLI3 (3+), HIV-EP1/ZNF40 (4), HIV-EP2 (2), KR1 (9+), KR2 (9), KR3 (15+), KR4 (14+), 
KR5 (1 1+), HF.12 (6+), REX-1 (4), ZfX (13), ZfY (13), Zfp-35 (18), ZNF7 (15), ZNF8 (7), 

20 ZNF35 (10), ZNF42/MZF-1 (13), ZNF43 (22), ZNF46/Kup (2), ZNF76 (7), ZNF91 (36), 
ZNF133(3). 

In addition to the conserved zinc ligand residues, it has been shown that a number of 
other positions are also important for the structural integrity of the C2H2 zinc fingers. 
(Rosenfeld et aL 9 J. Biomol Struct Dyn. (1993) 77:557) The best conserved position is 
25 found four residues after the second cysteine; it is generally an aromatic or aliphatic residue. 

The consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]- 
x(8)-H-x(3,5)-H. The two C's and two H's are zinc ligands. 

u) Zinc Finger, CCHC Class. SEQ ID NO:322 corresponds to a polynucleotide 
encoding a novel member of the zinc finger CCHC family. The CCHC zinc finger protein 
30 family to date has been mostly composed of retroviral gag proteins (nucleocapsid). The 
prototype structure of this family is from HIV. The family also contains members involved 
in eukaryotic gene regulation, such as C. elegans GLH-1. The consensus sequence of this 
family is based upon the common structure of an 18-residue zinc finger. 
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v) Zinc-Binding Metalloprotease Domain. SEQ ID NO:306 and 395 represent 

polynucleotides encoding novel members of the zinc-binding metalloprotease domain 

protein family. The majority of zinc-dependent metallopeptidases (with the notable 

exception of the carboxypeptidases) share a common pattern of primary structure (Jongeneel 

5 eiai,FEBSLett. (1989)242:211; Murphy et al, FEBSLetL (1991) 259:4; and Bode et al, 

Zoology (1996) 99:237) in the part of their sequence involved in the binding of zinc, and can 

be grouped together as a superfamily, known as the metzincins, on the basis of this sequence 

similarity. Examples of these proteins include: 1) Angiotensin-converting enzyme (EC 

3.4.15.1) (dipeptidyl carboxypeptidase I) (ACE), the enzyme responsible for hydrolyzing 

10 angiotensin I to angiotensin II. 2) Mammalian extracellular matrix metalloproteinases 
(known as matrixins) (Woessner, FASEBJ. (1991) 5:2145): MMP-1 (EC 3.4.24.7) 
(interstitial collagenase), MMP-2 (EC 3.4.24.24) (72 Kd gelatinase), MMP-9 (EC 3.4.24.35) 
(92 Kd gelatinase), MMP-7 (EC 3.4.24.23) (matrylisin), MMP-8 (EC 3.4.24.34) (neutrophil 
collagenase), MMP-3 (EC 3.4.24.17) (stromelysin-1), MMP-10 (EC 3.4.24.22) 

15 (stromelysin-2), and MMP-1 1 (stromelysin-3), MMP-1 2 (EC 3.4.24.65) (macrophage 
metalloelastase). 3) Endothelin-converting enzyme 1 (EC 3.4.24.71) (ECE-1), which 
processes the precursor of endothelin to release the active peptide. 

A signature pattern which includes the two histidine and the glutamic acid residues is 
sufficient to detect this superfamily of proteins, having the consensus pattern: [GSTALIVN]- 

20 x(2)-H-E-[LIVMFYW]-{DEHRKP}-H-x-[LIVMFYWGSPQ]. The two H's are zinc 
ligands, and E is the active site residue. 

Example 4: Differential Expression of Polynucleotides of the Invention : Description of 
Libraries and Detection of Differential Expression 

25 The relative expression levels of the polynucleotides of the invention was assessed in 

several libraries prepared from various sources, including cell lines and patient tissue 
samples. Table 4 provides a summary of these libraries, including the shortened library 
name (used hereafter), the mRNA source used to prepared the cDNA library, the "nickname" 
of the library that is used in the tables below (in quotes), and the approximate number of 

30 clones in the library. 
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Table 4 Description of cDNA Libraries 



Library 
(lib#) 


Description 


Number of 
Clones in this 
Clustering 


1 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential (derived 

fromKml2C) 

"High Colon" 


307133 


2 


Kml2C 

Human Colon Cell Line, Low Metastatic Potential 
"Low Colon" 


284755 


3 


MDA-MB-231 

Human Breast Cancer Cell Line, High Metastatic Potential; 
micro- metastases in lung 
"High Breast" 


326937 


4 


MCF7 

Human Breast Cancer Cell, Non Metastatic 
"Low Breast" 


318979 


8 


MV-522 

Human Lung Cancer Cell Line, High Metastatic Potential 
"High Lung" 


223620 


9 


UCP-3 

Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 


12 


Human microvascular endothelial cells (HMEC) - Untreated 
PCR (OligodT) cDNA library 


41938 


13 


Human microvascular endothelial cells (HMEC) - bFGF 
treated 

PCR (OligodT) cDNA library 


42100 


14 


Human microvascular endothelial cells (HMEC) - VEGF 
treated 

PCR (OligodT) cDNA library 


42825 


15 


Normal Colon - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


34285 


16 | 


Colon Tumor - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


35625 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


36984 


18 


Normal Colon - UC#3 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


36216 


19 


Colon Tumor - UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Tumor Tissue" 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


30956 



106 



WO 99/33982 PCT/US98/27610 
The KM12L4 and KM12C cell lines are described in Example 1 above. The MDA- 

MB-231 cell line was originally isolated from pleural effusions (Cailleau, J. Natl Cancer. 
Inst. (1974) 53:661), is of high metastatic potential, and forms poorly differentiated 
adenocarcinoma grade II in nude mice consistent with breast carcinoma. The MCF7 cell line 
was derived from a pleural effusion of a breast adenocarcinoma and is non-metastatic. The 
MV-522 cell line is derived from a human lung carcinoma and is of high metastatic 
potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the MV- 
522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as 
models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al, 
Cancer Res. (1979) 3P:870 (MDA-MB-231 and MCF-7 ); Gastpar et al, J Med Chem (1998) 
47:4965 (MDA-MB-23 1 and MCF-7); Ranson et al, Br J Cancer (1998) 77:1586 (MDA- 
MB-231 and MCF-7); Kuang et al, Nucleic Acids Res (1998) 25:1 1 16 (MDA-MB-231 and 
MCF-7); Varki et al , Int J Cancer ( 1 987) 40:46 (UCP-3); Varki etal, Tumour Biol (1990) 
77:327; (MV-522 and UCP-3); Varki et al, Anticancer Res. (1990) 70:637; (MV-522); 
Kelner et al, Anticancer Res (1 995) 75:867 (MV-522); and Zhang et al, Anticancer Drugs 
(1997) 5:696 (MV522)). The samples of libraries 1 5-20 are derived from two different 
patients (UC#2, and UC#3). 

Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate 
the analysis of the millions of sequences in each library, the sequences were assigned to 
clusters. The concept of "cluster of clones" is derived from a sorting/grouping of cDNA 
clones based on their hybridization pattern to a panel of roughly 300 7bp oligonucleotide 
probes (see Drmanac et al, Genomics (1996) J7(l):29). Random cDNA clones from a 
tissue library are hybridized at moderate stringency to 300 7bp oligonucleotides. Each 
oligonucleotide has some measure of specific hybridization to that specific clone. The 
combination of 300 of these measures of hybridization for 300 probes equals the 
"hybridization signature" for a specific clone. Clones with similar sequence will have 
similar hybridization signatures. By developing a sorting/grouping algorithm to analyze 
these signatures, groups of clones in a library can be identified and brought together 
computationally. These groups of clones are termed "clusters". Depending on the 
stringency of the selection in the algorithm (similar to the stringency of hybridization in a 
classic library cDNA screening protocol), the "purity" of each cluster can be controlled. For 
example, artifacts of clustering may occur in computational clustering just as artifacts can 

occur in 6t wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at even the 
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highest stringency. The stringency used in the implementation of cluster herein provides 

groups of clones that are in general from the same cDNA or closely related cDNAs. Closely 

related clones can be a result of different length clones of the same cDNA, closely related 

clones from highly related gene families, or splice variants of the same cDNA. 

Differential expression for a selected cluster was assessed by first determining the 

number of cDNA clones corresponding to the selected cluster in the first library (Clones in 

1 st ), and the determining the number of cDNA clones corresponding to the selected cluster in 

the second library (Clones in 2 nd ). Differential expression of the selected cluster in the first 

library relative to the second library is expressed as a "ratio" of percent expression between 

the two libraries. In general, the "ratio" is calculated by: 1) calculating the percent 

expression of the selected cluster in the first library by dividing the number of clones 

corresponding to a selected cluster in the first library by the total number of clones analyzed 

from the first library; 2) calculating the percent expression of the selected cluster in the 

second library by dividing the number of clones corresponding to a selected cluster in a 

second library by the total number of clones analyzed from the second library; 3) dividing 

the calculated percent expression from the first library by the calculated percent expression 

from the second library. If the "number of clones" corresponding to a selected cluster in a 

library is zero, the value is set at 1 to aid in calculation. The formula used in calculating the 

ratio takes into account the "depth" of each of the libraries being compared, i.e., the total 

number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially expressed 
between two samples when the ratio value is greater than at least about 2, preferably greater 
than at least about 3, more preferably greater than at least about 5 , where the ratio value is 
calculated using the method described above. The significance of differential expression is 
determined using a z score test (Zar, Biostatistical Analysis. Prentice Hall, Inc., USA, 
"Differences between Proportions," pp 296-298 (1974). 

Tables 5 to 7 (inserted before the claims) show the number of clones in each of the 
above libraries that were analyzed for differential expression. Examples of differentially 
expressed polynucleotides of particular interest are described in more detail below. 

Example 5: Polynucleotides Differentially Expressed in High Metastatic Potential Breast 

Cancer Cells Versus Low Metastatic Breast Cancer Cells 

A number of polynucleotide sequences have been identified that are differentially 

expressed between cells derived from high metastatic potential breast cancer tissue and low 
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metastatic breast cancer cells. Expression of these sequences in breast cancer can be 

valuable in determining diagnostic, prognostic and/or treatment information. For example, 

sequences that are highly expressed in the high metastatic potential cells can be indicative of 

increased expression of genes or regulatory sequences involved in the metastatic process. A 

patient sample displaying an increased level of one or more of these polynucleotides may 

thus warrant more aggressive treatment. In another example, sequences that display higher 

expression in the low metastatic potential cells can be associated with genes or regulatory 

sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential breast cancer cells and low metastatic potential 
breast cancer cells. 
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Table 8. Differentially expressed polynucleotides: High metastatic potential breast 

cancer vs. low metastatic breast cancer cells 





1 Differential Expression 


Cluster 


Clones in 


Clones in 


Katio 


IN LI. 


ID 


1 5t 

1 

Library 


<%nd 
2 

Library 




Q 

y 


High breast > Low breast (Lib3 > Lib4) 


2623 


1 1 
31 


4 


7.561356 


42 


High breast > Low breast (Lib3 > Lib4) 


OAT 

307 


196 


75 


2.549721 


JZ 


High breast > Low breast (Lib3 > Lib4) 


19 


1 1 H A 

1364 


525 


2.534854 


oz 


High breast > Low breast (Lib3 > Lib4) 


2623 


1 1 
3 1 


4 


7.561356 


OD 


High breast > Low breast (Lib3 > Lib4) 


ACS 

5749 


9 


0 


8.780930 


66 


High breast > Low breast (Lib3 > Lib4) 


6455 


6 


0 


5.853953 


68 


High breast > Low Breast (Lib3 > Lib4) 


6455 


6 


0 


5.853953 


1 14 


High Breast > Low Breast (Lib3 > Lib4) 


2030 


32 


4 


7.805271 


123 


High breast > Low breast (Lib3 > Lib4) 


3389 


1 1 
13 


2 


6.341782 


1 A A 

144 


High Breast > Low Breast (Lib3 > Lib4) 


4623 


12 


2 


^ or Trtn 

5.853953 


172 


High Breast > Low Breast (Lib3 > Lib4) 


102 


278 


1 16 


2.33 82 17 


1 TO 

178 


High Breast > Low Breast (Lib3 > Lib4) 


3681 


10 


1 


9.756589 


214 


High Breast > Low Breast (Lib3 > Lib4) 


3900 


8 


1 


7.805271 


O 1 ft 

21V 


High Breast > Low Breast (Lib3 > Lib4) 


3389 


13 


2 


6.341782 


223 


High Breast > Low Breast (Lib3 > Lib4) 


1399 


19 


7 


2.648217 


258 


High breast > Low breast (Lib3 > Lib4) 


4837 


10 


0 


9.756589 


317 


High breast > Low breast (Lib3 > Lib4) 


1577 


25 


3 


8. 130490 


3 ly 


High breast > Low breast (Lib3 > Lib4) 


260 


27 


2 


13.17139 


4 


Low Breast > High Breast (Lib4 > Lib3) 


3706 


22 


4 


5.637215 


39 


Low Breast > High Breast (Lib4 > Lib3) 


4016 


6 


0 


6.149690 


74 


Low Breast > High Breast (Lib4 > Lib3) 


6268 


18 


3 


6.149690 


O 1 

81 


¥ t «^ ¥ T " 1_ ¥*> a. ft ' i A ^ T • 1 O \ 

Low Breast > High Breast (Lib4 > Lib3) 


40392 


8 


1 


8.199586 


i^n 
i j\j 


low Joreasi riign oreast (lid4 > lid3 ) 


1 1 1 Q1 


1 


A 

V 


/.I /403o 


157 


Low Breast > High Breast (Lib4 > Lib3) 


5417 


9 


0 


9.224535 


162 


Low Breast > High Breast (Lib4 > Lib3) 


9685 


7 


0 


7.174638 


183 


Low Breast > High Breast (Lib4 > Lib3) 


7337 


16 


3 


5.466391 


202 


Low Breast > High Breast (Lib4 > Lib3) 


6124 


9 


1 


9.224535 


298 


Low Breast > High Breast (Lib4 > Lib3) 


1037 


22 


4 


5.637215 


338 


Low Breast > High Breast (Lib4 > Lib3) 


689 


36 


17 


2.170478 


384 


Low Breast > High Breast (Lib4 > Lib3) 


697 


72 


30 


2.459876 


386 


Low Breast > High Breast (Lib4 > Lib3) 


4568 


9 


0 


9.224535 


388 


Low Breast > High Breast (Lib4 > Lib3) 


5622 


13 


2 


6.662164 



5 Example 6: Polynucleotides Differentially Expressed in High Metastatic Potential Lung 
Cancer Cells Versus Low Metastatic Lung Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential lung cancer tissue and low 
metastatic lung cancer cells. Expression of these sequences in lung cancer tissue can be 
10 valuable in determining diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells are associated can 
be indicative of increased expression of genes or regulatory sequences involved in the 
metastatic process. A patient sample displaying an increased level of one or more of these 
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polynucleotides may thus warrant more aggressive treatment. In another example, 

sequences that display higher expression in the low metastatic potential cells can be 

associated with genes or regulatory sequences that inhibit metastasis, and thus the expression 

of these polynucleotides in a sample may warrant a more positive prognosis than the gross 

pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential lung cancer cells and low metastatic potential 
lung cancer cells: 



Table 9 Differentially expressed polynucleotides: High metastatic potential lung 
cancer vs. low metastatic lung cancer cells 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1" 
Library 


2 nd 
Library 




400 


High Lung > Low Lung (Lib8 > Lib 9) 


14929 


23 


16 


2.008868 


9 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


34 


High Lung > Low Lung (Lib8 > Lib9) 


5832 


5 


0 


6.987366 


42 


High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


62 


High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 


106 


High Lung > Low Lung (Lib8 > Lib9) 


10717 


8 


0 


11.17978 


119 


High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.52111 


361 


High Lung > Low Lung (Lib8 > Lib9) 


1120 


5 


0 


6.987366 


369 


High Lung > Low Lung (Lib8 > Lib9) 


2790 


6 


0 


8.384840 


371 


High Lung > Low Lung (Lib8 > Lib9) 


8847 


6 


1 


8.384840 


379 


High Lung > Low Lung (Lib8 > Lib9) 


260 


15 


0 


20.96210 


395 


High Lung > Low Lung (Lib8 > Lib9) 


13538 


9 


1 


12.57726 


135 


Low Lung > High Lung (Lib9 > Lib8) 


36313 


30 


1 


21.46731 


154 


Low Lung > High Lung (Lib9 > Lib8) 


5345 


27 


6 


3.220097 


160 


Low Lung > High Lung (Lib9 > Lib8) 


4386 


21 


3 


5.009039 


260 


Low Lung > High Lung (Lib9 > Lib8) 


4141 


27 


4 


4.830145 


308 


Low Lung > High Lung (Lib9 > Lib8) 


15855 


213 


12 


12.70149 


323 


Low Lung > High Lung (Lib9 > Lib8) 


5257 


25 


5 


3.577885 


349 


Low Lung > High Lung (Lib9 > Lib8) 


2797 


14 


1 


10.01807 


381 


Low Lung > High Lung (Lib9 > Lib8) 


2428 


19 


2 


6.797982 



Example 7: Polynucleotides Differentially Expressed in High Metastatic Potential Colon 
Cancer Cells Versus Low Metastatic Colon Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential colon cancer tissue and low 
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metastatic colon cancer cells. Expression of these sequences in colon cancer tissue can be 

valuable in determining diagnostic, prognostic and/or treatment information. For example, 

sequences that are highly expressed in the high metastatic potential cells can be indicative of 

increased expression of genes or regulatory sequences involved in the metastatic process. A 

patient sample displaying an increased level of one or more of these polynucleotides may 

thus warrant more aggressive treatment. In another example, sequences that display higher 

expression in the low metastatic potential cells can be associated with genes or regulatory 

sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 

sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and low metastatic potential 
colon cancer cells: 

Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
cancer vs. low metastatic colon cancer cells 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


Library 


Library 




1 


High Colon > Low Colon (Libl > Lib2) 


6660 


7 


0 


6.489973 


176 


High Colon > Low Colon (Libl > Lib2) 


3765 


19 


6 


2.935940 


241 


High Colon > Low Colon (Libl > Lib2) 


4275 


11 


2 


5.099264 


362 


High Colon > Low Colon (Libl > Lib2) 


6420 


8 


0 


7.417112 


374 


High Colon > Low Colon (Libl > Lib2) 


6420 


8 


0 


7.417112 


39 


Low Colon > High Colon (Lib2 > Libl) 


4016 


14 


5 


3.020043 


97 


Low Colon > High Colon (Lib2 > Libl) 


945 


21 


9 


2.516702 


134 


Low Colon > High Colon (Lib2 > Libl) 


2464 


19 


5 


4.098630 


317 


Low Colon > High Colon (Lib2 > Libl) 


1577 


40 


12 


3.595289 


357 


Low Colon > High Colon (Lib2 > Libl) 


4309 


13 


4 


3.505407 



Example 8: Polynucleotides Differentially Expressed at Higher Levels in High Metastatic 

Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 

A number of polynucleotide sequences have been identified that are differentially 

expressed between cells derived from high metastatic potential colon cancer tissue and 

normal tissue. Expression of these sequences in colon cancer tissue can be valuable in 

determining diagnostic, prognostic and/or treatment information. For example, sequences 

that are highly expressed in the high metastatic potential cells are associated can be 
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indicative of increased expression of genes or regulatory sequences involved in the advanced 

disease state which involves processes such as angiogenesis, dedifferentiation, cell 

replication, and metastasis. A patient sample displaying an increased level of one or more of 

these polynucleotides may thus warrant more aggressive treatment. 

5 The differential expression of these polynucleotides can be used as a diagnostic 

marker, a prognostic marker, for risk assessment, patient treatment and the like. These 

polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 

1 0 expression between high metastatic potential colon cancer cells and normal colon cells: 



Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
tissue vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1" 
Library 


2 nd 
Library 




52 


High Colon Metastasis Tissue > Normal 
Colon Tissue of UC#3 (Lib20 > Lib 18) 


19 


10 


0 


11.69918 


52 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl7 > Libl5) 


19 


13 


2 


6.025646 


172 


High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl7 > Libl5) 


102 


65 


22 


2.738930 



15 Example 9: Polynucleotides Differentially Expressed at Higher Levels in High Colon 
Tumor Potential Patient Tissue Versus Metastasized Colon Cancer Patient 
Tissue 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and cells 

20 derived from high metastatic potential colon cancer cells. Expression of these sequences in 
colon cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment 
information associated with the transformation of precancerous tissue to malignant tissue. 
This information can be useful in the prevention of achieving the advanced malignant state 
in these tissues, and can be important in risk assessment for a patient. 

25 The following table summarizes identified polynucleotides with differential 

expression between high tumor potential colon cancer tissue and cells derived from high 
metastatic potential colon cancer cells: 
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Table 12: Differentially expressed polynucleotides: High tumor potential colon tissue 



vs. metastatic colon tissue 

SEQID Differential Expression Cluster Clones in 1 st Clones in 2 nd Ratio 

NO. ID Library Library 

52 High Colon Tumor Tissue > Metastasis 19 69 10 5.160829 

Tissue of UC#3 (Lib 1 9 > Lib20) 
119 High Colon Tumor Tissue > Metastasis 8 14 1 10.47124 

Tissue of UC#3 (Libl9 > Lib20) 
172 High Colon Tumor Tissue > Metastasis 102 43 10 3.216168 

Tissue of UC#3 (Libl9 > Lib20) 



Example 10: Polynucleotides Differentially Expressed at Higher Levels in High Tumor 
Potential Colon Cancer Patient Tissue Versus Normal Patient Tissue 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and normal 
tissue. Expression of these sequences in colon cancer tissue can be valuable in determining 
diagnostic, prognostic and/or treatment information associated with the prevention of 
achieving the malignant state in these tissues, and can be important in risk assessment for a 
patient. For example, sequences that are highly expressed in the potential colon cancer cells 
are associated with or can be indicative of increased expression of genes or regulatory 
sequences involved in early tumor progression. A patient sample displaying an increased 
level of one or more of these polynucleotides may thus warrant closer attention or more 
frequent screening procedures to catch the malignant state as early as possible. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential colon cancer cells and normal colon cells: 



Table 13: Differentially expressed polynucleotides: High tumor potential colon tissue 
vs. normal colon tissue 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1" 


2 nd 




52 






Library 


Library 




High Colon Tumor Tissue > Normal Tissue 
ofUC#2 (Libl6>Libl5) 


19 


13 


2 


6.255508 


288 


High Colon Tumor Tissue > Normal Tissue 
ofUC#2(Libl6>Lib!5) 


1267 


7 


0 


6.125253 


52 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3 (Libl9>Libl8) 


19 


69 


0 


60.37750 


119 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3(Libl9>Libl8) 


8 


14 


1 


12.25050 


172 


High Colon Tumor Tissue > Normal Tissue 
ofUC#3 (Libl9>Libl8) 


102 


43 


7 


5.375222 
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Example 1 1 : Polynucleotides Differentially Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cancerous cells and normal cells across all three tissue types tested (i.e., 
5 breast, colon, and lung). Expression of these sequences in a tissue or any origin can be 
valuable in determining diagnostic, prognostic and/or treatment information associated with 
the prevention of achieving the malignant state in these tissues, and can be important in risk 
assessment for a patient. These polynucleotides can also serve as non-tissue specific 
markers of, for example, risk of metastasis of a tumor. The following table summarizes 
10 identified polynucleotides that were differentially expressed but without tissue type- 
specificity in the breast, colon, and lung libraries tested. 



Table 14: Polynucleotides Differentially Expressed Across Multiple Library Comparisons 



SEQID 


Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 




ID 


1 st 


2 nd 










Library 


Library 




9 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7 561156 




High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8 384840 


39 


Low Breast > High Breast (Lib4 > Lib3) 


4016 


6 


0 


6.149690 




Low Colon > High Colon (Lib2 > Libl) 


4016 


14 


5 


3.020043 


42 


High Breast > Low Breast (Lib3 > Lib4) 


307 


196 


75 


2.549721 




High Lung > Low Lung (Lib8 > Lib9) 


307 


79 


27 


4.088903 


52 


High Breast > Low Breast (Lib3 > Lib4) 


19 


1364 


525 


2.534854 




High Colon Metastasis Tissue > Normal 


19 


10 


0 


11.69918 




Colon Tissue of UC#3 (Lib20 > Lib 18) 












High Colon Metastasis Tissue > Normal 


19 


13 


2 


6.025646 




Tissue in UC#2 (Libl 7 > Libl 5) 












High Colon Tumor Tissue > Metastasis 


19 


69 


10 


5.160829 




Tissue of UC#3 (Libl9 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


19 


13 


2 


6.255508 




ofUC#2 (Libl6>Libl5) 












High Colon Tumor Tissue > Normal Tissue 


19 


69 


0 


60.37750 




ofUC#3 (Libl9>Libl8) 










62 


High Breast > Low Breast (Lib3 > Lib4) 


2623 


31 


4 


7.561356 




High Lung > Low Lung (Lib8 > Lib9) 


2623 


6 


1 


8.384840 


74 


High Lung > Low Lung (Lib8 > Lib9) 


6268 


5 


0 


6.987366 




Low Breast > High Breast (Lib4 > Lib3) 


6268 


18 


3 


6.149690 


119 


High Colon Tumor Tissue > Metastasis 


8 


14 


1 


10.47124 




Tissue of UC#3 (Libl9 > Lib20) 












High Colon Tumor Tissue > Normal Tissue 


8 


14 


1 


12.25050 




ofUC#3 (Libl9>Libl8) 












High Lung > Low Lung (Lib8 > Lib9) 


8 


1355 


122 


15.52111 


172 


High Breast > Low Breast (Lib3 > Lib4) 


102 


278 


116 


2.338217 




High Colon Metastasis Tissue > Normal 


102 


65 


22 


2.738930 




Tissue in UC#2 (Libl 7 > LiblS) 












High Colon Tumor Tissue > Metastasis 


102 


43 


10 


3.216168 
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SEQ ID Differential Expression 


Cluster 


Clones in 


Clones in 


Ratio 


NO. 


ID 




2 n< * 








Library 


Library 




Tissue of UC#3 (Libl9 > Lib20) 










High Colon Tumor Tissue > Normal Tissue 


102 


43 


7 


5.375222 


ofUC#3 (Libl9>Libl8) 










3 1 7 High Breast > Low Breast (Lib3 > Lib4) 


1577 


25 


3 


8.130490 


Low Colon > High Colon (Lib2 > Libl) 


1577 


40 


12 


3.595289 


379 High Breast > Low Breast (Lib3 > Lib4) 


260 


27 


2 


13.17139 


High Lung > Low Lung (Lib8 > Lib9) 


260 


15 


0 


20.96210 



Example 12: Polynucleotides Exhibiting Colon-Specific Expression 

The cDNA libraries described herein were also analyzed to identify those 
polynucleotides that were specifically expressed in colon cells or tissue, Le. 9 the 
polynucleotides were identified in libraries prepared from colon cell lines or tissue, but not 
in libraries of breast or lung origin. The polynucleotides that were expressed in a colon cell 
line and/or in colon tissue, but were present in the breast or lung cDNA libraries described 
herein, are shown in Table 15, 



Table 15 Polynucleotides specifically expressed in colon cells. 

SEQ ID Cluster Clones in Clones in SEQ ID Cluster Clones in Clones in 
NO. 1 st Library 2 nd Library NO. 1 st Library 2 nd Library 



5 


36535 


2 


0 


229 


39648 


2 


0 


13 


27250 


2 


0 


231 


85064 


1 


0 


19 


16283 


3 


0 


234 


39391 


2 


0 


24 


16918 


4 


0 


236 


39498 


2 


0 


26 


40108 


2 


0 


242 


22113 


3 


0 


32 


32663 


1 


1 


247 


19255 


2 


0 


43 


39833 


2 


0 


252 


22814 


3 


0 


47 


18957 


3 


0 


253 


39563 


2 


0 


48 


39508 


2 


0 


254 


39420 


2 


0 


56 


7005 


8 


2 


257 


39412 


2 


0 


58 


18957 


3 


0 


261 


38085 


2 


0 


59 


18957 


3 


0 


265 


40054 


1 


0 


60 


16283 


3 


0 


266 


39423 


2 


0 


64 


13238 


4 


1 


267 


39453 


2 


0 


70 


39442 


2 


0 


270 


78091 


1 


0 


71 


17036 


4 


0 


276 


39168 


2 


0 


73 


7005 


8 


2 


277 


39458 


2 


0 


83 


11476 


6 


0 


278 


14391 


3 


1 


86 


39425 


2 


0 


279 


39195 


2 


0 


94 


21847 


2 


1 


282 


12977 


5 


0 


100 


16731 


3 


1 


284 


14391 


3 


1 


101 


12439 


4 


0 


290 


16347 


4 


0 


113 


17055 


4 


0 


293 


39478 


2 


0 


120 


67907 


1 


0 


294 


39392 


2 


0 


121 


12081 


4 


0 


297 


39180 


2 


0 


124 


39174 


2 


0 


299 


6867 


7 


3 
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SEQ ID Cluster 


Clones in Clones in SEQ ID 


Cluster 


Clones in 


Clones in 


NO. 




1 st 


Library 2 nd 


Library NO. 




1" Library 2 nd Library 


126 


8210 


2 


6 


301 


41633 


1 


1 


128 


40455 


2 


0 


302 


23218 


3 


0 


139 


22195 


3 


0 


303 


39380 


2 


0 


143 


86859 


1 


0 


309 


84328 


1 


0 


150 


8672 


4 


4 


314 


14367 


3 


0 


153 


16977 


4 


0 


320 


39886 


2 


o 


156 


17036 


4 


0 


324 


9061 


5 


2 


159 


40044 


2 


0 


327 


16653 


3 


1 


161 


40044 


2 


0 


328 


16985 


4 


o 


163 


22155 


3 


0 


329 


12977 


5 


o 


166 


15066 


4 


0 


330 


9061 


5 


2 


170 


11465 


5 


0 


333 


16392 


3 


o 


176 


3765 


19 


6 


342 


39486 


2 


o 


181 


86110 


1 


0 


344 


6874 


6 


3 


182 


39648 


2 


0 


345 


6874 


6 


3 


185 


17076 


4 


0 


353 


11494 


4 


o 


186 


22794 


2 


0 


354 


17062 


3 


0 


187 


39171 


2 


0 


355 


16245 


4 


o 


194 


40455 


2 


0 


356 


83103 


1 


0 


199 


16317 


3 


0 


358 


13072 


4 


1 


210 


39186 


2 


0 


366 


14364 


1 


0 


211 


40122 


2 


0 


368 


84182 


1 


0 


218 


26295 


2 


0 


372 


56020 


1 


0 


222 


4665 


5 


9 


389 


7514 


5 


3 


226 


82498 


1 


0 


391 


7570 


5 


3 


227 


35702 


2 


0 


393 


23210 • 


3 


0 



In addition to the above, SEQ ID NOS:159 and 161 were each present in one clone in 
each of Lib 16 (Normal Colon Tumor Tissue), and SEQ ID NOS:344 and 345 were each 
present in one clone in Libl 7 (High Colon Metastasis Tissue). No clones corresponding to 
5 the colon-specific polynucleotides in the table above were present in any of Libraries 3, 4, 8, 
or 9. The polynucleotide provided above can be used as markers of cells of colon origin, and 
find particular use in reference arrays, as described above. 

Example 13: Identification of Contiguous Sequences Having a Polynucleotide of the 

10 Invention 

The novel polynucleotides were used to screen publicly available and proprietary 
databases to determine if any of the polynucleotides of SEQ ID NOS: 1-404 would facilitate 
identification of a contiguous sequence, e.g., the polynucleotides would provide sequence 
that would result in 5' extension of another DNA sequence, resulting in production of a 

1 5 longer contiguous sequence composed of the provided polynucleotide and the other DNA 
sequence(s). Contiging was performed using the AssemblyLign program with the following 
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parameters: 1) Overlap: Minimum Overlap Length: 30; % Stringency: 50; Minimum 

Repeat Length: 30; Alignment: gap creation penalty: 1 .00, gap extension penalty: 1.00; 2) 

Consensus: % Base designation threshold: 80. 

Using these parameters, 44 polynucleotides provided contiged sequences. These 

contiged sequences are provided as SEQ ID NOS:801-844. The contiged sequences can be 

correlated with the sequences of SEQ ID NOS: 1-404 upon which the contiged sequences are 

based by identifying those sequences of SEQ ID NOS: 1-404 and the contiged sequences of 

SEQ ID NOS:801-844 that share the same clone name in Table 1. It should be noted that of 

these 44 sequences that provided a contiged sequence, the following members of that group 

of 44 did not contig using the overlap settings indicated in parentheses (Stringency/Overlap): 

SEQ ID NO:804 (30%/10); SEQ ID NO:810 (20%/20); SEQ ID NO:812 (30%/10); SEQ ID 

NO:814 (40%/20); SEQ ID NO:816 (30%/10); SEQ ID NO:832 (30%/10); SEQ ID NO:840 

(20%/20); SEQ ID NO:841 (40%/20). To generalize, the indicated polynucleotides did not 

contig using a minimum 20% stringency, 10 overlap. There was a corresponding increase in 

the number of degenerate codons in these sequences. 

The contiged sequences (SEQ ID NO:801-844) thus represent longer sequences that 

encompass a polynucleotide sequence of the invention. The contiged sequences were then 

translated in all three reading frames to determine the best alignment with individual 

sequences using the BLAST programs as described above for SEQ ID NOS: 1-404 and the 

validation sequences SEQ ID NOS:405-800. Again the sequences were masked using the 

XBLAST profram for masking low complexity as described above in Example 1 (Table 2). 

Several of the contiged sequences were found to encode polypeptides having characteristics 

of a polypeptide belonging to a known protein families (and thus represent new members of 

these protein families) and/or comprising a known functional domain (Table 16). Thus the 

invention encompasses fragments, fusions, and variants of such polynucleotides that retain 

biological activity associated with the protein family and/or functional domain identified 

herein. 
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Table 16. Profile hits using contiged sequences 



SEQ 

ID 

NO. 


Sequence Name 


Profile 


Start 
(Stop) 


Score 


809 


Contig_RTA00000 1 77 AF.n. 1 8.3 . 
aeq_ J HCI i JUj 1 


ATPases 


778 
(1612) 


6040 


824 


Contig RTA00000 1 87AF.g.24. 1 . 
Seq_THCl 68636 


homeobox 


531 
(707) 


12080 


824 


Contig RTA00000187AF.g.24.1. 
Seq_THC 168636 


MAP kinase kinase 


769 
(1494) 


5784 


833 


Contig RTAO0000 190AFJ .4. 1 . 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 


833 


Contig RTA00000190AF.j.4.1. 
Seq_THC228776 


protein kinase 


170 
(1010) 


5027 



All stop/start sequences are provided in the forward direction. 



5 The profiles for the ATPases (AAA) and protein kinase families are described above 

in Example 2. The homeobox and MAP kinase kinase protein families are described further 
below. 

Homeobox domain. The 'homeobox' is a protein domain of 60 amino acids (Gehring 
In: Guidebook to the Homeobox Genes . Duboule D., Ed., ppl-10, Oxford University Press, 

1 0 Oxford, (1 994); Buerglin In: Guidebook to the Homeobox Genes, pp25-72, Oxford 

University Press, Oxford, (1994); Gehring Trends Biochem. ScL (1992) 77:277-280; Gehring 
etalAnnu. Rev. Genet (1986)20:147-173; Schofield Trends NeuroscL (1987) 70:3-6; 
http://copan.bioz.unibas.ch/ homeo.html) first identified in number of Drosophila homeotic 
and segmentation proteins. It is extremely well conserved in many other animals, including 

1 5 vertebrates. This domain binds DNA through a helix-turn-helix type of structure. Several 
proteins that contain a homeobox domain play an important role in development. Most of 
these proteins are sequence-specific DNA-binding transcription factors. The homeobox 
domain is also very similar to a region of the yeast mating type proteins. These are 
sequence-specific DNA-binding proteins that act as master switches in yeast differentiation 

20 by controlling gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. The helix- 
turn-helix region is shown by the symbols ! H* (for helix), and f t ! (for turn). 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHtttHHHHHHHHHxxxxxxxxxx 
25 1 60 
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The pattern detects homeobox sequences 24 residues long and spans positions 34 to 57 of the 

homeobox domain. The consensus pattern is as follows: [LIVMFYG]-[ASLVR]-x(2)- 

[LIVMSTACN]-x-[LIVM]-x(4)-^ 

[NDQTAH]-x(5)-[RKNAIMW] . 

MAP kinase kinase (MAPKK V MAP kinases (MAPK) are involved in signal 

transduction, and are important in cell cycle and cell growth controls. The MAP kinase 

kinases (MAPKK) are dual-specificity protein kinases which phosphorylate and activate 

MAP kinases. MAPKK homologues have been found in yeast, invertebrates, amphibians, 

and mammals. Moreover, the MAPKK/MAPK phosphorylation switch constitutes a basic 

module activated in distinct pathways in yeast and in vertebrates. MAPKK regulation 

studies have led to the discovery of at least four MAPKK convergent pathways in higher 

organisms. One of these is similar to the yeast pheromone response pathway which includes 

the stel 1 protein kinase. Two other pathways require the activation of either one or both of 

the serine/threonine kinase-encoded oncogenes c-Raf-1 and c-Mos. Additionally, several 

studies suggest a possible effect of the cell cycle control regulator cyclin-dependent kinase 1 

(cdc2) on MAPKK activity. Finally, MAPKKs are apparently essential transducers through 

which signals must pass before reaching the nucleus. For review, see, e.g., Biologique Biol 

Ce//(1993) 79:1 93-207; Nishidae/ al., Trends Biochem Sci (1993) 75:128-31; Ruderman 

Curr Opin Cell Biol (1993) 5:207-13; Dhanasekaran et ai 9 Oncogene (1998) 77:1447-55; 

Kiefer et aL, Biochem Soc Trans (1997) 25:491-8; and Hill, Cell Signal (1996) 5:533-44. 

Those skilled in the art will recognize, or be able to ascertain, using not more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such specific embodiments and equivalents are intended to be 
encompassed by the following claims. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 
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Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it is readily apparent to 

those of ordinary skill in the art in light of the teachings of this invention that certain changes 

and modifications may be made thereto without departing from the spirit or scope of the 

appended claims. 



Deposit Information : 

The following materials were deposited with the American Type Culture Collection: 
CMCC = (Chiron Master Culture Collection) 



Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession No. 


KM12L4-A 


March 19, 1998 


CRL-12496 


11606 


Kml2C 


May 15, 1998 


CRL-12533 


11611 


MDA-MB-231 


May 15, 1998 


CRL-12532 


10583 


MCF-7 


October 9, 1998 


CRL-12584 


10377 
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cDNA Library ESI - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001395A:C03 


4016 


79.Al.sp6:130016.Seq 


M00001395A:C03 


4016 


RTA00000118A.C.4.1 


M00001449A:D12 


3681 


RTA00000131A.g.l5.2 


M00001449A:D12 


3681 


79.El.sp6:130064.Seq 


M00001452A:D08 


1120 


79.C2.sp6:130041.Seq 


M00001452A:D08 


1120 


RTAO0000118A.p.l5.3 


M00001513A:B06 


4568 


79.D4.sp6:130055.Seq 


M00001513A:B06 


4568 


RTA00000122A.d.l5.3 


M00001517A:B07 


4313 


79.F4.sp6:130079.Seq 


M00001517A:B07 


4313 


RTA00000122A.n.3.1 


M00001533A:C11 


2428 


RTA00000123A.1.21.1 


M00001533A:C11 


2428 


79.A5.sp6:130020.Seq 


M00001533A:C11 


2428 


RTA00000123 A.1.21 . 1 .Seq_THC205063 


M00001542A:A09 


22113 


79.F5.sp6:130080.Seq 


M00001542A:A09 


22113 


RTA00000125A.C.7.1 


cDNA Library ES2 - 


ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00001343C.F10 


2790 


80.El.sp6:130256.Seq 


M00001343C:F10 


2790 


RTAOO0OO 1 77AF.e.2. 1 .Seq_THC22946 1 


M00001343C:F10 


2790 


RTAOO000 1 77AF.e.2. 1 


M00001343D:H07 


23255 


100.Cl.sp6:131446.Seq 


M00001343D:H07 


23255 


RTA0O000 1 77AF.e. 1 4.3 .Seq_THC228776 


M00001343D:H07 


23255 


80.Fl.sp6:130268.Seq 


M00001343D:H07 


23255 


RTA0O0O0 1 77AF.e. 1 4.3 


M00001345A:E01 


6420 


172.El.sp6:133925.Seq 


M00001345A:E01 


6420 


RTA00000177AF.f.l0.3 


M00001345A:E01 


6420 


RTA000001 77AF.f. 1 0.3 .Seq_THC226443 


M00001345A:E01 


6420 


80.Gl.sp6:130280.Seq 


M00001347A:B10 


13576 


80.D2.sp6:130245.Seq 


M00001347A:B10 


13576 


100.El.sp6:131470.Seq 


M00001347A:B10 


13576 


RTA00000177AF.g.l6.1 


M00001353A:G12 


8078 


80.E3.sp6:130258.Seq 


M00001353A:G12 


8078 


RTA00000177AR.1.13.1 


M00001353A:G12 


8078 


172.C3.sp6:133903.Seq 


M00001353D:D10 


14929 


RTA00000177AF.m.l.2 


M00001353D:D10 


14929 


80.F3.sp6:130270.Seq 


M00001353D:D10 


14929 


172.D3.sp6:133915.Seq 


M00001361A:A05 


4141 


80.B4.sp6:130223.Seq 


M00001361A:A05 


4141 


RTA00000177AF.p.20.3 


M00001362B:D10 


5622 


80.D4.sp6:130247.Seq 


M00001362B:D10 


5622 


RTA00000178AF.a.ll.l 
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cDNA Library ES3 - ATCC# 
Deposit Date - December 22, 1998 
Clone Name 
M00001362C:H11 
M00001362C:H11 
M00001362C:H11 
M00001362C:H11 
M00001376B:G06 
M00001376B:G06 
M0O0O1387A:CO5 
M0OOO1387A:CO5 
M00001412B:B10 
M00001412B:B10 
M00001415A:H06 
M00001415A:H06 
M00001416B:H11 
M00001416B:H11 
M00001429D:D07 
M00001429D:D07 
M00001448D:H01 
M00001448D:H01 
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Cluster ID 


Sennence Name 


945 


RTA00000178AR a 20 1 


945 


1 00.E4.sp6: 1 3 1 473 .Seq 


945 


80 E4 sd6* 130259 Sea 


945 


1 80 C2 sd6- 1 35940 Sen 


17732 


RTA00000178AR i 2 2 

A A \.\J\J\J\J\J A f UiilViIiLiL 


17732 


80 B5 sn6- 130224 Sen 


2464 


80 D6 sd6* 130249 Sea 


2464 


RTA00000178AF n 18 1 

* i»wvw * / uni .11. I U . 1 


8551 


RTA00000 1 79 AF d 2 1 1 


8551 


80 G7 <;n<vn02Xfi Sen 


13538 


80.B8.sp6:130227.Seq 


13538 


RTA00000180AF.a.24.1 


8847 


80.C8.sp6:130239.Seq 


8847 


RTA00000180AF.b.l6.1 


40392 


RTA00000180AF.j.8.1 


40392 


80.H9.sp6:130300.Seq 


36313 


80.All.sp6:130218.Seq 


36313 


RTA00000181AF.e.23.1 



cDNA Library ES4 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


M00001463C:B11 


19 


M00001463C:B11 


19 


M00001470A:B10 


1037 


M00001470A:B10 


1037 


M00001497A:G02 


2623 


M00001497A:G02 


2623 


M00001500A:E11 


2623 


M00001500A:Ell 


2623 


M00001501D:C02 


9685 


M00001501D:C02 


9685 


M00001501D:C02 


9685 


M00001504C:H06 


6974 


M00001504C:H06 


6974 


M00001504C:H06 


6974 


M00001504D:G06 


6420 


M00001504D:G06 


6420 


M00001504D:G06 


6420 


M00001504D:G06 


6420 


M00001528A:C04 


35555 


M00001528A.C04 


7337 


M00001528A:C04 


35555 



Sequence Name 

RTA00000182AF.b.7.1 

89.Dl.sp6:130703.Seq 

89.F2.sp6:130728.Seq 

RTA00000121A.f.8.1 

89.F3.sp6:130729.Seq 

RTA00000183AF.a.6.1 

RTA00000183AF.b.l4.1 

89.A4.sp6:130670.Seq 

RTAO0000 1 83 AF.c. 11.1 .Seq_THC 1 09544 

RTA00000183AF.C.I1.1 

89.C4.sp6:130694.Seq 

89.F4.sp6:130730.Seq 

RTA00000183AF.d.9.1 

RTAOOO00 1 83AF.d.9. 1 .Seq_THC223 1 29 

173.F5.SP6:134133.Seq 

89.G4.sp6:130742.Seq 

RTA00000 1 83 AF.d. 1 1 . 1 .Seq_THC226443 

RTA00000183AF.d.ll.l 

89.B6.sp6:130684.Seq 

RTA00000123A.b.l7.1 

184.A5.sp6:135530.Seq 



123 



WO 99/33982 PCT/US98/27610 
cDNA Library ESS - ATCC# 



Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001537B:G07 


3389 


RTA00000183AF.m.l9.1 


M00001537B:G07 


3389 


89A8.sp6:130674.Seq 


M00001541A:D02 


3765 


89.C8.sp6:130698.Seq 


M00001541A:D02 


3765 


RTA00000135A.d.l.l 


M00001544B:B07 


6974 


89.A9.sp6:130675.Seq 


M00001544B:B07 


6974 


RTA00000184AF.a.l5.1 


M00001546A:G11 


1267 


89.D9.sp6:130711.Seq 


M00001546A:G1 1 


1267 


RTA00000125A.O.5.1 


M00001549B:F06 


4193 


89.G9.sp6:130747.Seq 


M00001549B:F06 


4193 


RTA00000184AF.e.l3.1 


M00001556A:F11 


1577 


173.C9.SP6:134101.Seq 


M00001556A:F11 


1577 


89.Fll.sp6:130737.Seq 


M00001556A:F11 


1577 


RTA00000184AF.i.23.1 


M00001556B:C08 


4386 


RTA00000184AFJ.4.1 


M00001556B:C08 


4386 


89.Hll.sp6:130761.Seq 


cDNA Library ES6 - 


ATCC# 




Deposit Date - December 22, 1998 




Clone Name 


Cluster ID 


Sequence Name 


M00001563B:F06 


102 


RTA00000184AF.O.5.1 


M00001563B:F06 


102 


90.Bl.sp6: 130871. Seq 


M00001571C:H06 


5749 


90.El.sp6:130907.Seq 


M00001571C.H06 


5749 


RTA00000185AF.a.l9.1 


M00001594B:H04 


260 


90.D2.sp6:130896.Seq 


M00001594B:H04 


260 


RTA00000185AR.i.l2.2 


M00001597C:H02 


4837 


90.E2.sp6:130908.Seq 


M00001597C:H02 


4837 


RTA00000185AR.k.3.2 


M00001624C:F01 


4309 


90.C4.sp6:130886.Seq 


M00001624C:F01 


4309 


RTA000001 86AF.e.22. 1 


M00001679A:A06 


6660 


90.F6.sp6:130924.Seq 


M00001679A:A06 


6660 


122.B5.sp6:132089.Seq 


M00001679A:A06 


6660 


RTA00000187AF.h.l5.1 


M00003759B:B09 


697 


90.G8.sp6:130938.Seq 


M00003759B:B09 


697 


RTA00000188AF.d.6.1 


M00003759B:B09 


697 


RTA0O000 1 88AF.d.6. 1 .Seq_THC 1 78884 


M00003844C:B11 


6539 


176.D9.sp6:134556.Seq 


M00003844C:B11 


6539 


RTA00000189AF.d.22.1 


M00003844C:B11 


6539 


90.B10.sp6:130880.Seq 


M00003857A:G10 


3389 


90.All.sp6:130869.Seq 


M00003857A:G10 


3389 


RTA00000189AF.g.3.1 
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WO 99/33982 
cDNA Library ES7 - ATCC# 
Deposit Date - December 22, 1998 



PCI7US98/27610 



Clone Name 

M00003914C:F05 

M00003914C:F05 

M00003922A:E06 

M00003922A:E06 

M00003922A:E06 

M00003983A:A05 

M00003983A:A05 

M00004028D:A06 

M00004028D:A06 

M00004031A:A12 

M00004031A:A12 

M00004087D:A01 

M00004087D:A01 

M00004108A:E06 

M00004108A:E06 

M00004H4C:F11 

M00004114C:F11 

M00004114C:F11 



cDNA Library ES8 - ATCC# 
Deposit Date - December 22, 1998 



Cluster ID 


Seauence Name 


3900 


99.El.sp6:131278.Seq 


3900 


RTA00000190AF e 13 1 


23255 


RTA00000190AF i 4 1 


23255 


99 Fl sd6'131290 Sea 


23255 


RTA00000190AF.j.4.1.Seq_THC228776 


9105 


99.C3.sp6.i31256.Seq 


9105 


RTA00000191AFa21 2 


6124 


RTA00000191AR.e.2.3 


6324 


99 D3 sd6* 1 3 1 268 Sen 


9061 


RTA00000191AR.e.ll.2 


9061 


RTA00000191AR.e.ll.3 


6880 


RTA00000191AF.m.20.1 


6880 


99.A5.sp6:131234.Seq 


4937 


99.E5.sp6:131282.Seq 


4937 


RTA00000191AF.p.21.1 


13183 


123.D5.sp6:132305.Seq 


13183 


RTA00000192AF.a.24.1 


13183 


99.G5.sp6:131306.Seq 



Clone Name 


Cluster ID 


Sequence Name 


M00004146C:C11 


5257 


99.B6.sp6:131247.Seq 


M00004146C:C11 


5257 


177.F5.sp6:134768.Seq 


M00004146C:C11 


5257 


RTA00000192AF.f.3.1 


M00004146C:C11 


5257 


RTAO0000 1 92 AF.f.3 . 1 .Seq_THC2 1 3 833 


M00004157C:A09 


6455 


RTA00000192AF.g.23.1 


M00004157C:A09 


6455 


99.D6.sp6:131271.Seq 


M00004157C:A09 


6455 


123.E7.sp6:132319.Seq 


M00004172C:D08 


11494 


RTA00000192AF.j.6.1 


M00004172C:D08 


11494 


99.G6.sp6:131307.Seq 


M00004172C:D08 


11494 


177.E6.sp6:134757.Seq 


M00004229B:F08 


6455 


RTA00000193AF.b.9.1 


M00004229B:F08 


6455 


99.C8.sp6: 13 1261. Seq 


cDNA Library ES9 


- ATCC# 




Deposit Date - December 22, 1998 





M00001466A:E07 
M00001531A:H11 
M00001531A:H11 
MO0001551A:B10 
M00001551A:B10 
M00001551A:B10 
M00001552A:B12 
M00001552A:B12 
M00001556A:H01 
M00001586C:C05 
M00001604A:B10 
M00001604A:B10 
M00003879B:C11 
M00003879B:C11 



4275 



6268 

6268 

6268 

307 

307 

15855 

4623 

1399 

1399 

5345 

5345 



RTA00000120A.j.l4.1 

89. F6.sp6:130732.Seq 
RTA00000123A.g.l9.1 
79.G9.sp6:130096.Seq 
184.C12.sp6: 135561. Seq 
RTA00000126A.O.23.1 
RTA00000136A.O.4.2 
79.C7.sp6:130046.Seq 
RTA00000184AF.j.l.l 
RTA00000185AF.f.4.1 
79.G8.sp6:130095.Seq 
RTA00000129A.O.10.1 
RTA0O000189AF.1.19.1 

90. B12.sp6:130882.Seq 
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WO 99/33982 PCT/US98/27610 
cDNA Library ES10 - ATCC# 



Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001358C:C06 




RTA00000177AF.O.4.3 


M00001388D:G05 


5832 


80.F6.sp6:130273.Seq 


M00001388D:G05 


5832 


RTA00000178AF.O.23.1 


M00001394A:F01 


6583 


RTA00000179AF.d.l3.1 


M00001394A:F01 


6583 


172.B8.sp6:133896.Seq 


M00001394A:F01 


6583 


80.H6.sp6:130297.Seq 


M00001429A:H04 


2797 


RTA00000180AF.i.l9.1 


M00001447A:G03 


10717 


RTA000OO181AF.d.lO.l 


M00001448D:C09 


8 


80.H10.sp6:130301.Seq 


M00001448D:C09 


8 


RTA00000181AF.e.l7.1 


M00001448D:C09 


8 


100.Bll.sp6:131444.Seq 


M00001454D:G03 


689 


RTA00000181AR.I.22.1 



cDNA Library ESI 1 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 

M00003975A:G11 

M00003978B:G05 

M00003978B:G05 

M00004059A:D06 

M00004068B:A01 

M00004068B:A01 

M00004205D:F06 

M00004205D:F06 

M00004205D:F06 

M00004212B:C07 

M00004223A:G10 



Cluster ID 

12439 

5693 

5693 

5417 

3706 

3706 



2379 
16918 



Sequence Name 

RTA00000190AF.O.24.1 

RTA0O000 1 90AF.p. 1 7.2.Seq_THC 1 733 1 8 

RTA00000190AF.p.l7.2 

RTA00000191AF.h.l9.1 

99.C4.sp6:131257.Seq 

RTA00000191AF.I.17.2 

99.E7.sp6:131284.Seq 

177.G7.sp6:134782.Seq 

RTA00000192AF.O.11.1 

RTA00000192AF.p.8.1 

RTA00000193AF.a.l6.1 



cDNA Library ESI 2 - ATCC# 
Deposit Date - December 22, 1998 



Clone Name 

M00004223B:D09 

M00004249D:G12 

M00004251C:G07 

M00004372A:A03 



Cluster ID 
7899 



2030 



Sequence Name 

RTA00000193AF.a.l7.1 

RTA00000193AF.C.22.1 

RTA00000193AF.d.2.1 

RTA00000193AF.m.20.1 



126 



WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001340B:A06 

M00001340B:A06 

M00001340D:F10 

M00001340D:F10 

M00001341A:E12 

M00001341A:E12 

M00001342B:E06 

M00001342B:E06 

M00001346A:F09 

M00001346A:F09 

M00001346D:G06 

M00001346D.G06 

M00001348B:B04 

M00001348B:B04 

M00001348B:G06 

M00001348B:G06 

M00001349B:B08 

M00001349B:B08 

MO00O1350A:HOl 

M000O1350A:HOl 

M00001350A:H01 

M00001352A:E02 

M00001352A:E02 

M00001352A:E02 

M00001355B:G10 

M00001355B:G10 

M00001355B:G10 

M00001355B:G10 

M000O1361D:FO8 

MO00O1361D:FO8 

M00001365C:C10 

M00001365C:CIO 

M00001368D:E03 

M00001368D:E03 

M00001370A:C09 

M00001370A:C09 

M00001371C:E09 

M00001371C:E09 

M00001371C:E09 

M00001378B:B02 

M00001378B:B02 

M0O0O1379A:AO5 

M00001379A:A05 

MO00O1380D:BO9 

M00001380D:B09 

M00001381D:E06 

M00001381D:E06 

M00001382C:A02 

M00001382C:A02 

M00001384B:A11 

M00001384B:A11 

M00001386C:B12 



Cluster ID 

17062 

17062 

11589 

11589 

4443 

4443 

39805 

39805 

5007 

5007 

5779 

5779 

16927 

16927 

16985 

16985 

3584 

3584 

7187 

7187 

7187 

16245 

16245 

16245 

14391 

14391 

14391 

14391 

2379 

2379 

40132 

40132 



6867 

6867 

7172 

7172 

7172 

39833 

39833 

1334 

1334 

39886 

39886 



22979 
22979 



5178 



Sequence Name 

80.Al.sp6:130208.Seq 

RTA00000177AF.b.8.4 

80.Bl.sp6:130220.Seq 

RTA00000177AF.b.l7.4 

80.Cl.sp6:130232.Seq 

RTA00000177AF.b.20.4 

80.Dl.sp6:130244.Seq 

RTA00000177AF.C.21.3 

RTA00000177AF.g.2.1 

80.Hl.sp6:130292.Seq 

RTA00000177AF.g.l4.3 

RTA00000177AF.g.l4.1 

80.E2.sp6:130257.Seq 

RTA00000177AF.h.9.3 

RTAOO000177AF.h.lO.l 

80.F2.sp6:130269.Seq 

RTA00000177AF.h.20.1 

80.G2.sp6:130281.Seq 

100.C2.sp6:131447.Seq 

80.A3.sp6:130210.Seq 

RTA00000177AF.i.8.2 

RTA00000177AF.k.9.3 

172.D2.sp6:133914.Seq 

80.D3.sp6:130246.Seq 

RTA00000177AF.m.l7.3 

80.G3.sp6:130282.Seq 

172.H3.sp6:133963.Seq 

100.E3.sp6:131472.Seq 

80.C4.sp6:130235.Seq 

RTA00000178AF.a.6.1 

RTA00000178AF.C.7.1 

80.F4.sp6:130271.Seq 

80.G4.sp6:130283.Seq 

RTA00000178AF.d.20.1 

80.H4.sp6:130295.Seq 

RTA00000178AF.e.l2.1 

100.A5.sp6:131426.Seq 

RTA00000178AF.f.9.1 

80.A5.sp6:130212.Seq 

8O.C5.sp6:130236.Seq 

RTA00000178AF.i.23.1 

80.D5.sp6:130248.Seq 

RTA00000178AF.j.7.1 

RTA00000178AFJ.24.1 

80.E5.sp6:130260.Seq 

80.F5.sp6:130272.Seq 

RTA00000178AF.k.l6.1 

80.G5.sp6:130284.Seq 

RTA00000178AF.k.22.1 

80.B6.sp6:130225.Seq 

RTA00000178AF.m.l3.1 

80.C6.sp6:130237.Seq 
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WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22, 1998 



PCT/US98/27610 



Clone Name 

M00001386C:B]2 

MOO001387B:G03 

M00001387B:GO3 

M00001389A:C08 

M00001389A:C08 

M00001396A:C03 

M00001396A:C03 

M00001396A:C03 

M00001400B.-H06 

M00001400B:H06 

M00001400B:H06 

M00001400B:H06 

M00001402A:E08 

M00001402A:E08 

M00001407B.D11 

M0000-1407B:D11 

M00001410A:D07 

M00001410A:D07 

M00001410A:D07 

M00001414A:B01 

M00001414A:B01 

M00001414C:A07 

M00001414C:A07 

M00001416A:H01 

M00001416A:H01 

M00001417A:E02 

M00001417A:E02 

M00001423B:E07 

M00001423B:E07 

M00001424B:G09 

M00001424B:G09 

M00001425B:H08 

M00001425B:H08 

M00001426B:D12 

M00001426B:D12 

M00001426D:C08 

M00001426D:C08 

M00001428A:H10 

M00001428A:H10 

M00001428A:H10 

M00001449A:A12 

M00001449A:A12 

M00001449A:B12 

M00001449A:B12 

M00001449A:G10 

M00001449A:G10 

M00001449A:G10 

M00001449C:D06 

M00001449C:D06 

M00001450A:A02 

M00001450A:A02 

M00001450A:A02 



Cluster ID 

5178 

7587 

7587 

16269 

16269 

4009 

4009 

4009 



39563 

39563 

5556 

5556 

7005 

7005 

7005 



7674 

7674 

36393 

36393 

15066 

15066 

10470 

10470 

22195 

22195 



4261 

4261 

84182 

84182 

84182 

5857 

5857 

41633 

41633 

36535 

36535 

36535 

86110 

86110 

39304 

39304 

39304 



Sequence Name 

RTA00000178AF.n.l0.1 

80.E6.sp6:l 30261. Seq 

RTA00000178AF.n.24.1 

RTA00000178AF.p.l.l 

80.G6.sp6:130285.Seq 

I72.D8.sp6:133920.Seq 

80.A7.sp6:130214.Seq 

RTA00000179AF.e.20.1 

172.B9.sp6:133897.Seq 

80.B7.sp6:130226.Seq 

RTA00000179AF.j.l3.1 

RTA00000 1 79AFJ.1 3 . 1 .Seq_THC 1 05720 

80.C7.sp6:130238.Seq 

RTA00000179AF.k.20.1 

RTA00000179AF.n.l0.1 

80.D7.sp6:130250.Seq 

180.H5.sp6:136003.Seq 

RTA00000179AF.O.22.1 

80.F7.sp6:130274.Seq 

RTA00000180AF.a.9.1 

80.H7.sp6:130298.Seq 

80.A8.sp6:130215.Seq 

RTA00000180AF.a.ll.l 

79. Cl.sp6:130040.Seq 
RTA00000118A.g.9.1 
RTA00000180AF.C.2.1 

80. D8.sp6: 130251. Seq 
RTA00000180AF.e.24.1 
80.H8.sp6:130299.Seq 
80.A9.sp6:130216.Seq 
RTA00000180AF.f.l8.1 
RTA000OO18OAF.g.7.1 
80.B9.sp6:130228.Seq 
RTA00000180AF.g.22.1 
80.C9.sp6:130240.Seq 
80.D9.sp6:130252.Seq 
RTA00000180AF.h.5.1 
100.G9.sp6:131502.Seq 
RTA00000180AF.h.l9.1 
80.E9.sp6:130264.Seq 
80.Bll.sp6:130230.Seq 
RTA00000118A.g.l4.1 
80.Cll.sp6:130242.Seq 
RTA00000118A.g.l6.1 
RTA00000181AF.f.5.1 
80.Dll.sp6:130254.Seq 
100.Dll.sp6:131468.Seq 
RTA00000181AF.f.l2.1 
80.Ell.sp6:130266.Seq 

RTA00000 1 1 8A.j .21.1 .Seq_THC 1 5 1 859 

RTA00000118A.j.21.1 

79.Fl.sp6:130076.Seq 
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WO 99/33982 PCT/US98/27610 
cDNA Library ESI 3 - ATCC# 



Deposit Date - December 22, 1998 



Clone Name 


Cluster ID 


Sequence Name 


M00001450A:A02 


39304 


180.G9.sp6:135995.Seq 


M00001450A:A11 


32663 


80.Fll.sp6:I30278.Seq 


M00001450A:A11 


32663 


RTAOOOOO 118A.1.8.1 


M00001450A:B12 


82498 


100.Fll.sp6:131492.Seq 


M00001450A:B12 


82498 


RTAOOOOO 118A.m. 10.1 


M00001450A:B12 


82498 


79.Gl.sp6:130088.Seq 


M00001450A:D08 


27250 


80.Gll.sp6:130290.Seq 


M00001450A:D08 


27250 


180.B10.sp6:135936.Seq 


M00001450A:D08 


27250 


RTA00000181AF.g.l0.1 


M00001452AB04 

XT A\ WW * » * * • V ■ 


84328 


RTAOOOOO 118A.p. 10.1 


M00001452A:B04 


84328 


79.A2.sp6:130017.Seq 


M00001452A:B12 


86859 


RTAOOOOO 118A.p.8.1 


M00001452A-B12 


86859 


79 B2.sd6: 130029 Sea 


M00001452AF05 

XTXWW X 


85064 


RTA00000 1 3 1 A.m .23 . 1 


M00001452AF05 


85064 


79 D2 sd6 1 30053 Sea 


M00001452C-B06 


16970 


80 Hll sd6* 130302 Sea 

VJ Willi X • l_J V/ ■ X mJ \J ~J \M A\+ t»JVVJ 


M00001452C-B06 


16970 


100 C12 sd6-131457 Sea 


M00001452CB06 


16970 


RTA00000181AR i 18 2 

XV A I X \/ W W * w XX 1 ■ X \M mAW 


M00001453A-E11 


16130 


80 A12 sd6*130219 Sea 


M00001453A-E11 


16130 


100 D12 sd6'131469 Sea 


M00001453A-E11 

i ~ x v \/ w i r +s +j a x m xj x x 


16130 


RTA00000119A c 13 l 

XV X I x w w v/ x X ^ A X • W • X » X 


M00001453C:F06 


16653 


80.B12.sp6:130231.Seq 


M00001453C-F06 

x»x w W X »^*/^ W • A w 


16653 


RTA000001 8 1 AF.k.5.3 


M00001454A:A09 


83103 


RTAOOOOO 1 19A.e.24.2 


M00001454AA09 

X»A W W X • «/ Ti V * i XV/-*' 


83103 


79.G2.sp6:130089.Seq 


M00001454B-C12 

X v A w W X T T U m V— > X A* 


7005 


121 D1.sd6:131917 Sea 


M00001454B:C12 


7005 


RTAOOOOO 1 8 1 AF.k.24. 1 


M00001454BC12 


7005 


80.C 1 2.sp6: 1 30243 .Seq 


M00001455B:E12 


13072 


80.F12.sp6:130279.Seq 


M00001455BE12 


13072 


RTA00000181 AR m 5 2 

A\ X a X W V W X V X A ItVf III • 


M00001460AF06 

A ▼ X- V/ W v X F WX X» ■ X V w 


2448 


89.Al.sp6:130667.Seq 


M00001460A:F06 


2448 


RTA00000119A.j.21.1 


M00001461A:D06 


1531 


89.Cl.sp6:130691.Seq 


M00001461A:D06 


1531 


RTAOOOOO 119A.0.3.1 


M00001465A:B11 


10145 


79.F3.sp6:130078.Seq 


M00001465A:B11 


10145 


RTAOOOOO 120A.g. 12.1 


M00001467A:B07 


38759 


89.Fl.sp6:130727.Seq 


M00001467A:B07 


38759 


RTAOOOOO 120 A.m. 12.3 


M00001467A:D04 


39508 


RTAOOOOO 120A.O.2.1 


M00001467A:D04 


39508 


89.Gl.sp6:130739.Seq 


M00001467A:E10 


39442 


89.A2.sp6:130668.Seq 


M00001467A:E10 


39442 


RTAOOOOO 120A.O.2 1.1 


M00001468A:F05 


7589 


RTAOOOOO 1 20 A.p.23.1 


M00001468A:F05 


7589 


89.B2.sp6:130680.Seq 


M00001469A:A01 




RTA00000121A.C.10.1 


M00001469A:A01 




89.C2.sp6:130692.Seq 


M00001469A:C10 


12081 


89.D2.sp6:130704.Seq 


M00001469A:C10 


12081 


RTA00000133A.d.l4.2 


M00001469A:H12 


19105 


89.E2.sp6:130716.Seq 


M00001469A:H12 


19105 


RTA00000133A.e.l5.1 


M00001470A:C04 


39425 


89.G2.sp6:130740.Seq 


M00001470A:C04 


39425 


RTA00000133A.f.l.l 
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Clone Name 

M00001471A:B01 

M00001471A:B01 

M00001487B:H06 

M00001487B:H06 

M00001488B:F12 

M00001488B:F12 

M00001494D:F06 

M00001494D:F06 

M00001499B:A11 

M00001499B:A11 

M00001499B:A11 

MOO0015OOA:CO5 

MO00O15OOA:CO5 

M00001504A:E01 

M00001504A:E01 

M00001504A:E01 

M00001504C:A07 

M00001504C:A07 

M00001505C:C05 

M00001505C:C05 

M00001506D:A09 

M00001506D:A09 

M00001506D:A09 

M00001507A:H05 

M00001507A:H05 

M00001535A:F10 

M00001535A:F10 

M00001541A:H03 

M00001541A:H03 

M00001544A:G02 

M00001544A:G02 

M00001545A:D08 

M00001545A:D08 

M00001551A:F05 

M00001551A:F05 

M00001552A:D11 

M00001552A:D11 

M00001557A:F03 



Cluster ID 

39478 

39478 



7206 

7206 

10539 

10539 

10539 

5336 

5336 



10185 
10185 



39168 
39168 
39423 
39423 
39174 
39174 
19829 
19829 
13864 
13864 
39180 
39180 
39458 
39458 
39490 



Sequence Name 

89.H2.sp6:130752.Seq 

RTA00000133A.1.5.1 

RTA00000182AF.1.15.1 

89.B3.sp6:130681.Seq 

RTA00000182AF.1.20.1 

89.C3.sp6:l 30693. Seq 

RTA00000182AF.O.15.1 

89.E3.sp6:130717.Seq 

RTA00000183AF.a.24.1 

89.G3.sp6:130741.Seq 

173.B5.SP6:134085.Seq 

RTA00000183AF.b.l3.1 

89.H3.sp6:130753.Seq 

RTA00000183AF.C.24.1 

89.D4.sp6:130706.Seq 

RTA00000183AF.c.24.1.Seq_THC125912 

RTA00000183AF.d.5.1 

89.E4.sp6:130718.Seq 

89.H4.sp6:130754.Seq 

RTA00000183AF.e.l.l 

89.A5.sp6: 130671. Seq 

RTA00000183AF.e.23.1 

121.G6.sp6:131958.Seq 

RTA0OO00121A.1.1O.1 

89.B5.sp6:130683.Seq 

79.C5.sp6:130044.Seq 

RTA00000I34A.k.22.1 

79.E5.sp6:130068.Seq 

RTA00000124A.n.l3.1 

79.H5.sp6:130104.Seq 

RTA00000125A.h.24.4 

RTA00000125A.m.9.1 

79.B6.sp6:130033.Seq 

RTA00000126A.n.8.2 

79.A7.sp6:130022.Seq 

RTA00000126A.p.l5.2 

79.D7.sp6:130058.Seq 

RTA00000128A.b.4.1 
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Clone Name 

M00001511A:H06 

M00001511A:H06 

M00001512A:A09 

M00001512A:A09 

M00001512D:G09 

M00001512D:G09 

M00001512D:G09 

MOOO01513B:GO3 

M00001513B:G03 

M00001513B:G03 

M00001513C:E08 

M0OO01513C:E08 

M00001514C:D11 

M00001514C:D11 

M00001514C:D11 

M00001518C:B11 

M00001518C:B11 

M00001528B:H04 

M00001528B:H04 

M00001531A:D01 

M00001531A:D01 

M00001534A:C04 

M00001534A:C04 

M00001534A:D09 

M00001534A:D09 

M00001534C:A01 

M00001534C:A01 

M00001535A:C06 

M00001535A:C06 

M00001535A:C06 

M00001536A:B07 

M00001536A:B07 

M00001537A:F12 

M00001537A:F12 

M00001540A:D06 

M00001540A:D06 

M00001542A:E06 

M00001542A:E06 

M00001544A:E06 

M00001544A:E06 

M00001544A:E06 

M00001545A:B02 

M00001545A:B02 

M00001548A:E10 

M00001548A:E10 

M00001548A:E10 

M00001549C:E06 

M00001549C:E06 

M00OO155OA:A03 

M00001550A:A03 

M00001550A:G01 

M00001550A:G01 



Cluster ID 

39412 

39412 

39186 

39186 

3956 

3956 

3956 



14364 

14364 

40044 

40044 

40044 

8952 

8952 

8358 

8358 

38085 

38085 

16921 

16921 

5097 

5097 

4119 

4119 

20212 

20212 

20212 

2696 

2696 

39420 

39420 

8286 

8286 

39453 

39453 



5892 

5892 

5892 

16347 

16347 

7239 

7239 

5175 

5175 



Sequence Name 

RTA00000133A.k.l7.1 

89.C5.sp6:130695.Seq 

89.D5.sp6:130707.Seq 

RTA00000121A.p.l5.1 

89.E5.sp6:130719.Seq 

173.H5.SP6:134157.Seq 

RTA00000183AF.g.3.1 

RTA00000183AF.g.9.1 

89.F5.sp6: 13073 l.Seq 

RTA000001 83 AF.g.9.1.Seq_THC 198280 

RTA00000183AF.g.l2.l 

89.G5.sp6:130743.Seq 

RTA00000183AF.g.22.1 

RTA00000183AF.g.22.1.Seq_THC232899 

89.H5.sp6:130755.Seq 

89.A6.sp6:130672.Seq 

RTA00000183AF.h.l5.1 

89.D6.sp6:130708.Seq 

RTA00000183AF.i.5.1 

RTA00000123A.e.l5.1 

89.E6.sp6:130720.Seq 

RTA00000183AF.k.6.1 

89.H6.sp6:130756.Seq 

RTA00000134A.k.l.l 

RTAOOOOO 134A.k. 1 . 1 .Seq_THC2 1 5869 

RTA00000183AF.k.l6.1 

89.C7.sp6:130697.Seq 

89.E7.sp6:130721.Seq 

RTA00000134A.1.22.1 .Seq_THC 128232 

RTA00000134A.1.22.1 

RTA00000134A.m.l3.1 

89.F7.sp6:130733.Seq 

89.H7.sp6:130757.Seq 

RTA00000134A.O.23.1 

89.B8.sp6:130686.Seq 

RTAOOOOO 1 83 AF.o. 1.1 

89.E8.sp6:130722.Seq 

RTA00000135A.g.ll.l 

RTAOOOOO 184AF.a.8.1 

173.G7.SP6:134147.Seq 

89.H8.sp6:130758.Seq 

89.B9.sp6:130687.Seq 

RTA00000135A.1.2.2 

89.E9.sp6:130723.Seq 

RTAOOOOO 184AF.d. 11.1 

RTAOOOOO 1 84AF.d. 11.1 .Seq_THC 1 6 1 896 

89.H9.sp6:130759.Seq 

RTA00000184AF.e.l5.1 

89.A10.sp6:130676.Seq 

RTAOOOOO 126A.m.4.2 

RTA00000184AF.f.3.1 

89.B10.sp6:130688.Seq 
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Clone Name 

M00001551A:G06 

M00001551A:G06 

M00001551C:G09 

M00001551C:G09 

M00001553A:H06 

M00001553A:H06 

M00001553B:F12 

M00001553B:F12 

M00001555A:B02 

M00001555A:B02 

M000O1555A:COl 

M00001555A:C01 

M00001555D:G10 

M00001555D:G10 

M00001556A:C09 

M00001556A-.C09 

M00001556B:G02 

M00001556B:G02 

M000O1557B:H10 

M000O1557B:H10 

M00001557B:H10 

M00001557D:D09 

M0O0O1557D:D09 

M00001558B-.H11 

M00001558B:H11 

M00001559B:F01 

M000O1559B:F01 

MG0001560D:F10 

M00001560D:F10 

M00001566B:D11 

M00001566B:D11 

M00001583D:A10 

M00001583D:A10 

M00001590B:F03 

M00001590B:F03 

M00001597D:C05 

M00001597D:C05 

M00001598A:G03 

M00001598A:G03 

M00001601A:D08 

M00001601A:D08 

M00001607A:E11 

M00001607A:E11 

M00001608A:B03 

M00001608A:B03 

M00001608B:E03 

M00001608B:E03 

M00001608D:A11 

M00001608D:A11 

M00001614C:F10 

M00001614C:F10 

M00001617C:E02 



Cluster ID 

22390 

22390 

3266 

3266 

8298 

8298 

4573 

4573 

39539 

39539 

39195 

39195 

4561 

4561 

9244 

9244 

11294 

11294 

5192 

5192 

5192 

8761 

8761 

7514 

7514 



6558 
6558 



6293 
6293 



10470 

10470 

16999 

16999 

22794 

22794 

11465 

11465 

7802 

7802 

22155 

22155 



13157 
13157 
17004 



Sequence Name 

RTA00000136AJ.13.1 

89.C10.sp6:130700.Seq 

RTA00000184AR.g.l.l 

89.D10.sp6:130712.Seq 

RTA00000127A.d.l9.1 

89.G10.sp6:130748.Seq 

89.H10.sp6:130760.Seq 

RTA00000184AF.h.9.1 

RTA00000127A.i.21.1 

89.Bll.sp6:130689.Seq 

89.Cll.sp6:130701.Seq 

RTA00000137A.C.16.1 

RTA00000184AF.i.21.1 

89.Dll.sp6:130713.Seq 

89.Ell.sp6:130725.Seq 

RTA00000127A.1.3.1 

RTA00000184AFJ.6.1 

89.A12.sp6:130678.Seq 

173.E9.SP6:134125.Seq 

RTA00000184AF.k.2.1 

89.D12.sp6:130714.Seq 

RTA00000184AF.k.l2.1 

89.E12.sp6:130726.Seq 

RTA00000184AF.k.21.1 

89.G12.sp6:130750.Seq 

89. H12.sp6:130762.Seq 
RTA00000184AF.1.11.1 

90. Al.sp6:130859.Seq 
RTA00000184AF.m.21.1 
RTA00000184AF.p.3.1 
90.Dl.sp6:130895.Seq 
RTA00000185AF.e.ll.l 
90.A2.sp6:130860.Seq 
RTA00000185AF.g.ll.l 
90.C2.sp6:130884.Seq 
RTA00000185AF.k.6.1 
90.F2.sp6:130920.Seq 
90.G2.sp6.130932.Seq 
RTA00000185AF.k.9.1 
RTA00000138A.b.5.1 
90.H2.sp6:130944.Seq 
RTA00000185AF.m.l9.1 
90.A3.sp6:130861.Seq 
RTA00000185AF.n.5.1 
90.B3.sp6:130873.Seq 
RTA00000185AF.n.9.1 
90.C3.sp6:130885.Seq 
RTA00000185AF.n.l2.1 
90.D3.sp6:130897.Seq 
RTA00000186AF.a.6.1 
90.E3.sp6:130909.Seq 
RTA00000186AF.b.21.1 
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Clone Name 

M00001617C:E02 

M00001619C:F12 

M00001619C:F12 

MQ0001621C:C08 

M00001621C:C08 

M00001621C:C08 

M00001621C:C08 

M00001623D:F10 

M00001623D:F10 

M00001632D:H07 

M00001632D:H07 

M00001632D:H07 

M00001632D:H07 

M00001644C:B07 

M00001644C:B07 

M00001644C:B07 

M00001645A:C12 

M00001645A:C12 

M00001645A:C12 

M00001645A:C12 

M00001648C:A01 

M00001648C:A01 

M00001657D:C03 

M00OO1657D:C03 

M00001657D:F08 

M00001657D:F08 

M00001662C:A09 

M00001662C:A09 

M00001663A:E04 

M00001663A:E04 

M00001669B:F02 

M00001669B:F02 

M00001670C:H02 

M00001670C:H02 

M00001673C.H02 

M00001673C:H02 

M00001675A.-C09 

M00001675A:C09 

M00001675A:C09 

M00001676B:F05 

M00001676B:F05 

M00001676B:F05 

M00001677D:A07 

M00001677D:A07 

M00001677D:A07 

M00001678D:F12 

M00001678D.-F12 

M00001679A:F10 

M00001679A:F10 

M00001679B:F01 

M00001679B:F01 

M00001680D:F08 



Cluster ID 

17004 

40314 

40314 

40044 

40044 

40044 

40044 

13913 

13913 



39171 

39171 

39171 

19267 

19267 

19267 

19267 

4665 

4665 

23201 

23201 

76760 

76760 

23218 

23218 

35702 

35702 

6468 

6468 

14367 

14367 

7015 

7015 

8773 

8773 

8773 

11460 

11460 

11460 

7570 

7570 

7570 

4416 

4416 

26875 

26875 

6298 

6298 

10539 



Sequence Name 

90.F3.sp6:130921.Seq 

90.G3.sp6:130933.Seq 

RTA00000186AF.C.15.1 

RTA00000186AF.d.l.l 

RTA00000 1 86AF.d. 1 . 1 .Seq_THC232899 

90.H3.sp6:130945.Seq 

122.El.sp6:132121.Seq 

RTA00000186AF.e.6.1 

90.A4.sp6:130862.Seq 

RTA00000 1 86AF.h. 14.1 .Seq_THC 1 1 2525 

RTA00000186AF.h.l4.1 

90.E4.sp6:130910.Seq 

176.A3.sp6:134514.Seq 

RTA00000186AF.1.7.1 

90.F4.sp6:130922.Seq 

217.A12.sp6:139369.Seq 

RTA00000 1 86AF.1. 1 2. 1 .Seq_THC 1 78 1 83 

176.G3.sp6:134586.Seq 

RTA00000186AF.1.12.1 

90.G4.sp6:130934.Seq 

90.H4.sp6:130946.Seq 

RTA00000186AF.m.3.1 

RTA00000187AF.a.l4.1 

90.B5.sp6:130875.Seq 

90.C5.sp6:130887.Seq 

RTA00000187AF.a.l5.1 

RTA000O0187AR.C.5.2 

90.D5.sp6:130899.Seq 

90.E5.sp6:130911.Seq 

RTA00000187AR.C.15.2 

90.F5.sp6:130923.Seq 

RTA00000187AF.d.l5.1 

90.G5.sp6:130935.Seq 

RTA00000187AF.e.8.1 

90.H5.sp6:130947.Seq 

RTA00000187AF.f.l8.1 

RTA00000187AF.f.24.1 

90.A6.sp6:130864.Seq 

RTAOOOOO 1 87AF.f.24. 1 .Seq_THC220002 

RTA00000187AF.g.l2.1 

90.B6.sp6:130876.Seq 

219.F2.sp6:139035.Seq 

90.D6.sp6:130900.Seq 

RTAOOOOO 1 87 AF.g.24.1 

RTA000001 87AF.g.24. 1 .Seq_THC 1 68636 

90.E6.sp6:130912.Seq 

RTA00000187AF.h.l3.1 

RTAOOOOO 187AF.i. 1.1 

90.A7.sp6:130865.Seq 

90.B7.sp6:130877.Seq 

RTA00000187AR.i.l0.2 

90.F7.sp6:130925.Seq 
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Clone Name 
M0000l68OD:FO8 
M00001680D:F08 
M00001682C:BI2 
M00001682C:B12 
M00001682C:B12 
M00001688C:F09 
M00001688C:F09 
M00001693C:G01 
M00001693C:G01 
M00001716D:H05 
M00001716D:H05 
M00003741D:C09 
M00003741D:C09 
M00003747D:C05 
M00003747D:C05 
M00003747D:C05 
M00003747D:C05 
M00003754C:E09 
M00003754C:E09 
M00003761D.A09 
M00003761D:A09 
M00003761D:A09 
M00003762C:B08 
M00003762C:B08 
M00003762C:B08 
M00003763A:F06 
M000O3763A:F06 
MO00O3774C:A03 
M00003774C:A03 
M00003774C:A03 
M00003784D:D12 
M00003784D:D12 
M000O3839A:D08 
MO00O3839A:DO8 
M000O3851B:D08 
M00003851B:D08 
M00003851B:D10 
M00003851B:D10 
M00003853A:D04 
M00003853A:D04 
M00003853A:F12 
M00003853A:F12 
M00003856B:C02 
M00003856B:C02 
M00003857A:H03 
MO00O3857A:HO3 
M00003857A:H03 



Cluster ID 

10539 

10539 

17055 

17055 

17055 

5382 

5382 

4393 

4393 

67252 

67252 

40108 

40108 

11476 

11476 

11476 

11476 



17076 

17076 

17076 

3108 

3108 

67907 

67907 

67907 



7798 
7798 



13595 

13595 

5619 

5619 

10515 

10515 

4622 

4622 

4718 

4718 

4718 



Sequence Name 

219.F6.sp6:139039.Seq 

RTA00000187AF.I.7.1 

90.G7.sp6:130937.Seq 

RTA00000187AF.m.3.1 

176.D6.sp6:l 34553. Seq 

90.A8.sp6:130866.Seq 

RTA00000187AF.m.23.2 

RTAOO00Ol87AF.n.l7.1 

90.B8.sp6:130878.Seq 

RTA00000187AF.O.6.1 

90.C8.sp6:130890.Seq 

90.D8.sp6:130902.Seq 

RTA00000187AF.O.24.1 

RTA00000187AF.p.l9.1 

90.E8.sp6:130914.Seq 

RTA000001 87AF.p. 1 9. 1 .Seq_THC 1 08482 

219.H8.sp6:139065.Seq 

90.F8.sp6:130926.Seq 

RTA00000188AF.b.l2.1 

RTAOO000188AF.d.ll.l 

90.H8.sp6:130950.Seq 

RTA000001 88AF.d. 11.1 .Seq_THC2 12094 

RTAOOOOO 1 88AF.d.2 1 . 1 .Seq_THC208760 

90.A9.sp6:130867.Seq 

RTA00000188AF.d.21.1 

RTA00000188AF.d.24.1 

90.B9.sp6:130879.Seq 

RTA00000188AF.g.l 1 .1 .Seq_THC 123222 

RTA00000188AF.g.ll.l 

90.C9.sp6: 130891. Seq 

RTA00000188AF.i.8.1 

90.D9.sp6:130903.Seq 

RTA00000189AF.C.18.1 

90.A10.sp6:130868.Seq 

90.D10.sp6:130904.Seq 

RTA00000189AF.f.7.1 

90.E10.sp6:130916.Seq 

RTA00000189AF.f.8.1 

90.F10.sp6:130928.Seq 

RTA00000189AF.f.l7.1 

90.G10.sp6:130940.Seq 

RTAOOOOO 189AF.f. 18.1 

90.H10.sp6:130952.Seq 

RTAOOOOO 1 89 AF.g. 1.1 

90.Bll.sp6:130881.Seq 

RTAOOOOO 1 89AF.g.5. 1 .Seq_THC 1 961 02 

RTA00000189AF.g.5.1 
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Clone Name 

M00003867A:D10 

M00003867A:D10 

M00003871C:E02 

M00003875C:G07 

M00003875C:G07 

M00003875D:D1 1 

M00003875D:D1 1 

M00003876D:E12 

M00003876D:E12 

M00003906C:E10 

M00003906C:E10 

M00003907D:A09 

M00003907D:A09 

M00003907D:A09 

M00003907D:H04 

M00003907D:H04 

M00003909D:C03 

M00003909D:C03 

M00003968B:F06 

M00003968B:F06 

M00003970C:B09 

M00003970C:B09 

M00003970C:B09 

M00003974D:E07 

M00003974D:E07 

M00003974D:E07 

M00003974D:H02 

M00003974D:H02 

M00003974D:H02 

M00003981A:E10 

M00003981A:E10 

M00003982C:C02 

M00003982C:C02 

M00003982C:C02 

M00004028D:C05 

M00004028D:C05 

M00004035C:A07 

M00004035C:A07 

M00004035D:B06 

M00004035D:B06 

M00004072A:C03 

M00004072A:C03 

M00004081C:D10 

M00004081C:D10 

M00004086D:G06 

M00004086D:G06 

M00004105C:A04 

M00004105C:A04 

M00004171D:B03 

M00004171D:B03 

M00004185C:C03 

M00004185C:C03 



Cluster ID 



4573 
8479 
8479 



7798 

7798 

9285 

9285 

39809 

39809 

39809 

16317 

16317 

8672 

8672 

24488 

24488 

40122 

40122 

40122 

23210 

23210 

23210 

23358 

23358 

23358 

3430 

3430 

2433 

2433 

2433 

40073 

40073 

37285 

37285 

17036 

17036 



15069 

15069 

9285 

9285 

7221 

7221 

4908 

4908 

11443 

11443 



Sequence Name 

90.Cll.sp6:130893.Seq 

RTA00000189AF.h.l7.1 

RTA00000189AF.j.l2.1 

90.Gll.sp6:130941.Seq 

RTA00000189AF.j.22.1 

90.Hll.sp6:130953.Seq 

RTA00000189AF.j.23.1 

90.A12.sp6:130870.Seq 

RTA00000189AF.k.l2.1 

90.H12.sp6:130954.Seq 

RTA00000190AF.d.7.1 

99.Al.sp6:131230.Seq 

RTA00000 1 90AF.e.3 . 1 .Seq_THC 1 502 1 7 

RTA00000190AF.e.3.1 

99.Bl.sp6:131242.Seq 

RTA00000190AF.e.6.1 

RTA00000190AF.f.ll.l 

99.Cl.sp6:131254.Seq 

RTA00000190AF.n.l6.1 

99.C2.sp6:131255.Seq 

RTA00000190AF.n.23.1 

RTA00000190AF.n.23.1.Seq_THC109227 

99.D2.sp6:131267.Seq 

RTA00000190AF.O.20.1 

RTAO0000 1 90AF.O.20. 1 .Seq_THC207240 

99.E2.sp6:131279.Seq 

RTA000001 90AF.O.2 1 . 1 .Seq_THC207240 

RTA00000190AF.O.21.1 

99.F2.sp6: 13 1291. Seq 

99.A3.sp6:131232.Seq 

RTA00000191AF.a.9.1 

RTA00000191AF.a.l5.2 

99.B3.sp6:131244.Seq 

RTAO00O0 1 91 AF.a. 1 5.2.Seq_THC79498 

RTA00000191AF.e.3.1 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

RTA00000191AF.f.ll.l 

RTA00000191AF.f.l3.1 

99.A4.sp6:131233.Seq 

RTA00000191AF.j.9.1 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

RTA00000191AF.1.6.1 

99.H4.sp6:131317.Seq 

RTA00000191AF.m.l8.1 

99.D5.sp6:131270.Seq 

RTA00000191AF.p.9.1 

RTA00000192AF.j.2.1 

99.F6.sp6:131295.Seq 

RTA00000192AF.1.13.2 

123.A8.sp6:132272.Seq 
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Clone Name 

M00004185C:C03 

M00004191D:B11 

M00004191D:B11 

M00004191D:B11 

M00004197D:H01 

M00004197D:H01 

M00004197D:H01 

M00004203B:C12 

M00004203B:C12 

M00004214C:H05 

M00004214C:H05 

M00004223D:E04 

M00004223D:E04 

M00004269D:D06 

M00004269D:D06 

M00004295D:F12 

M00004295D:F12 

M00004296C:H07 

M00004296C:H07 

M00004307C:A06 

M00004307C:A06 

M00004307C:A06 

M00004312A:G03 

M00004312A:G03 

M00004312A:G03 

M00004318C:D10 

M00004318C:D10 

M00004359B:G02 

M00004359B:G02 

M00004505D:F08 

M00004505D:F08 

M00004692A:H08 

M00004692A:H08 

M00004692A:H08 

M00005180C:G03 



Cluster ID 
11443 



8210 

8210 

8210 

14311 

14311 

11451 

11451 

12971 

12971 

4905 

4905 

16921 

16921 

13046 

13046 

9457 

9457 

9457 

26295 

26295 

26295 

21847 

21847 



Sequence Name 

99.A7.sp6:131236.Seq 

RTA00000192AF.m.l2.1 

99.B7.sp6:131248.Seq 

123.C8.sp6:132296.Seq 

99.C7.sp6:131260.Seq 

123.E8.sp6:132320.Seq 

RTA00000192AF.n.l3.1 

99.D7.sp6:131272.Seq 

RTA00000192AF.O.2.1 

177.D8.sp6:134747.Seq 

RTA00000192AF.p.l7.1 

RTA00000193AF.a.20.1 

99.B8.sp6:131249.Seq 

99.H8.sp6:131321.Seq 

RTA00000193AF.e.l4.1 

99.D9.sp6:131274.Seq 

RTA00000193AF.h.l5.1 

99.E9.sp6:131286.Seq 

RTA00000193AF.h.l9.1 

RTA00000193AF.i.l4.2 

99.F9.sp6:131298.Seq 

123.Dll.sp6.132311.Seq 

RTA00000193AF.i.24.2 

99.G9.sp6:131310.Seq 

RTA00000193AF.i.24.2.Seq_THC197345 

RTA00000193AF.j.9.1 

99.H9.sp6:131322.Seq 

RTA000001 93 AF.m.5.1 .Seq_THCl 733 1 8 

RTA00000193AF.m.5.1 

RTA00000194AF.b.l9.1 

99.H10.sp6:131323.Seq 

99.Bll.sp6:131252.Seq 

RTA00000194AF.C.24.1 

377.F4.sp6:141957.Seq 

RTA00000194AF.f.4.1 



136 



WO 99/33982 
cDNA Library ESI 6 - ATCC# 
Deposit Date - December 22, 1 998 



PCT/US98/27610 



Clone Name 

M00001346D:E03 

M00001350A:B08 

M00001350A:B08 

M00001357D.-D11 

M00001357D:D1 1 

M00001409C:D12 

M00001409C:D12 

M00001418B:F03 

M00001418B:F03 

M00001418B:F03 

M00001418D:B06 

M00001421C:F01 

M00001421C:F01 

M00001429B:A11 

M00001432C:F06 

M00001439C:F08 

M00001442C:D07 

M00001442C:D07 

M00001443B:F01 

M00001443B:F01 

M00001445A:F05 

M00001445A:F05 

M00001446A:F05 

M00001455A:E09 

M00001455A:E09 

M00001460A:F12 

M00001481D:A05 

M00001490B:C04 

M00001490B:C04 

M00001500C:E04 

M00001500C:E04 

M00001532B:A06 

M00001532B:A06 

M00001534A:F09 

M00001534A:F09 

M00001535A:B01 

M00001536A:C08 

M00001536A:C08 

M00001541A:F07 

M00001542B:B01 

M00001542B:B01 

M00001544A:E03 

M00001545A:C03 

M00001545A:C03 

M00001545A:C03 

M00001548A:H09 

M00001548A:H09 

M00001548A:H09 

M00001549A:B02 

M00001549A:B02 

M00001549A:D08 

MO0001552B:D04 



Cluster ID 
6806 



4059 
4059 
9577 
9577 
9952 
9952 
9952 
8526 
9577 
9577 
4635 

40054 
16731 
16731 



13532 

13532 

7801 

13238 

13238 

39498 

7985 

18699 

18699 

9443 

9443 

3990 

3990 

5321 

5321 

7665 

39392 

39392 

22085 



12170 

19255 

19255 

19255 

1058 

1058 

1058 

4015 

4015 

10944 

5708 



Sequence Name 

RTA00000177AF.g.l3.3 

80.H2.sp6:130293.Seq 

RTA00000177AF.i.6.2 

RTA0O000 1 77AF.n. 1 8.3 .Seq_THC 1 2305 1 

RTA00000177AF.n.l8.3 

RTA00000179AF.O.17.1 

80.E7.sp6:130262.Seq 

RTA00000180AF.C.20.1 

RTA000001 80AF.C.20. 1 .Seq_THC 1 62284 

80.E8.sp6:130263.Seq 

RTA00000180AF.d.l.l 

RTA00000180AF.d.23.1 

80.G8.sp6:130287.Seq 

RTA00000180AF.L20.1 

RTA00000180AF.L24.1 

RTA00000180AF.p.l0.1 

RTA00000181AF.a.20.1 

80.C10.sp6:130241.Seq 

80.D10.sp6:130253.Seq 

RTA00000181AF.b.7.1 

80.E10.sp6.130265.Seq 

RTA00000181AF.C.4.1 

RTA00000181AF.C.21.1 

RTA00000181AF.m.4.1 

RTA0OO00 1 8 1 AF.m.4. 1 .Seq_THC 1 4069 1 

RTA00000119AJ.20.1 

RTAOOOOO 182ARJ.2.1 

RTA00000182AF.m.l6.1 

89.D3.sp6:130705.Seq 

89.B4.sp6:130682.Seq 

RTA0OO0O183AF.C.1.1 

89.G6.sp6.i30744.Seq 

RTA00000183AF.j.ll.l 

89.B7.sp6:130685.Seq 

RTA00000183AF.k.8.1 

RTA00000134A.1.19.1 

89.G7.sp6:130745.Seq 

RTA00000134A.m.l6.1 

RTA00000135A.e.5.2 

RTA00000183AF.p.4.1 

89.F8.sp6:130734.Seq 

RTAOOOOO 125 A.h. 18.4 

RTA00000135A.m.l8.1 

184.Bl0.sp6:135547.Seq 

89.C9.sp6:130699.Seq 

RTAOOOOO 126A.e.20.3.Seq_THC2 1 7534 

RTA00000126A.e.20.3 

79.F6.sp6:130081.Seq 

RTAOOOOO 136A.e. 12.1 

79.G6.sp6:130093.Seq 

RTAOOOOO 126A.h. 17.2 

RTAOOOOO 184AF.g. 12.1 
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Clone Name 

M00001552B:D04 

M00001552D:A01 

M00001552D:A01 

M00001553D:D10 

M00001553D:D10 

M00001558A:H05 

M00001558A:H05 

M00001561A:C05 

M000O1561A:CO5 

M00001564A:B12 

M00001578B:E04 

M00001579D:C03 

M00001579D:C03 

M00001579D:C03 

M00001582D:F05 

M00001587A:B11 

M00001587A:B11 

M00001604A:F05 

M00001604A:F05 

M00001624A:B06 

M00001624A:B06 

M00001624A:B06 

M00001630B:H09 

M00001630B:H09 

M00001630B:H09 

MOO0O1651A:H01 

M00001651A:H01 

M00001677C:E10 

M00001679C:F01 

M00001679C:F01 

M00001679C:F01 

M00001686A:E06 

M00003796C:D05 

M00003796C:D05 

M00003826B:A06 

M00003826B:A06 

M00003833A:E05 

M00003837D:A01 

M00003837D:A01 

M00003846B:D06 

M00003846B:D06 

M00003879B:D10 

M00003879B:D10 

M00003879D:A02 

M00003879D:A02 

M00003891C:H09 

M00003891C:H09 

M00003912B:D01 

M00003912B:D01 

M00004072B:B05 

M00004081C:D12 

M00004111D:A08 



Cluster ID 
5708 



22814 
22814 



39486 

39486 

5053 

23001 

6539 

6539 

6539 

39380 

39380 

39391 

39391 

3277 

3277 

3277 

5214 

5214 

5214 



14627 

78091 

78091 

78091 

4622 

5619 

5619 

11350 

11350 

21877 

7899 

7899 

6874 

6874 

31587 

31587 

14507 

14507 



12532 
12532 
17036 
14391 
6874 



Sequence Name 

89.E10.sp6:130724.Seq 

89.F10.sp6:130736.Seq 

RTA00000184AF.g.22.1 

RTA00000184AF.h.l4.1 

89.All.sp6:130677.Seq 

RTA00000128A.C.20.1 

89. F12.sp6:130738.Seq 
RTA00000128A.m.22.2 
79.B8.sp6:130035.Seq 
RTA00000184AF.O.12.1 
RTA00000185AF.C.24.1 

90. Gl.sp6:130931.Seq 
173.A12.SP6:134080.Seq 
RTA00000185AF.d.ll.l 
RTA00000185AF.d.24.1 
RTA00000129A.e.24.1 
79.E8.sp6:130071.Seq 
RTA00000138A.C.3.1 
79.A9.sp6:130024.Seq 
RTA00000138A.1.5.1 
217.El.sp6:139406.Seq 
90.B4.sp6:130874.Seq 
90.D4.sp6:130898.Seq 
122.C2.sp6:132098.Seq 
RTA00000186AF.g.ll.l 
RTA00000186AF.n.7.1 
90.A5.sp6:130863.Seq 
RTA00000187AF.g.23.1 
90.C7.sp6:130889.Seq 
RTA00000187AFJ.6.1 
176.G5.sp6:134588.Seq 
RTA00000187AF.m.l5.2 
RTAOO0OO 1 88AF.1.9. 1 .Seq_THC 1 67845 
RTA00000188AF.1.9.1 
RTA00000189AF.a.24.2 
90.F9.sp6:130927.Seq 
RTA00000189AF.b.21.1 
90.H9.sp6:130951.Seq 
RTA00000189AF.C.10.1 
RTA00000189AF.e.9.1 
90.C10.sp6:130892.Seq 
RTA00000189AF.1.20.1 
90.C12.sp6:130894.Seq 
90.D12.sp6:130906.Seq 
RTA00000189AR.1.23.2 
90.G12.sp6:130942.Seq 
RTA00000189AF.p.8.1 
99.Dl.sp6:131266.Seq 
RTA00000190AF.g.2.1 
RTA00000191AF.j.l0.1 
RTA00000191AF.1.7.1 
RTA00000192AF.a.l4.1 
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Clone Name 


Cluster ID 


Seouence Name 


M00004111DA08 


6874 


99 F5 sd6 131294 Sea 


M00004121B:G01 




177.H4.sp6:134791.Seq 


M00004121BG01 




99 H5 so6i31318 Sea 


M00004121BG01 




RTA00000 1 92 AF.c.2 . 1 


M00004138BH02 


13272 


99 A6 sd6 131235 Sea 


M00004138BH02 


13272 


RTA00000 1 92 AF.e.3 . 1 


M00004151D:B08 


16977 


RTA00000 1 92 AF.g.3 . 1 


M00004I69CC12 


5319 


99 E6 sd6 13 1283 Sea 


M00004169CC12 


5319 


RTAOO000 1 92AF.i. 1 2. 1 


M00004I69CC12 


5319 


123 F7 sd6* 132331 Sea 




16392 


RTA00000192AF 1 1 1 

i\ 1 AVUUUv 1 7*>rM .1.1.1 


M00004183C-D07 


16392 


RTA00000192AF 1 1 1 Sea THC202071 


M00004230BC07 


7212 


RTA0O000 1 93 AF.b. 1 4. 1 


M00004230TVr07 




99 D8 snfi- 1 3 1 273 Sen 

77.lyO.jUU. U 1 Z. / J.OCU 






RTA00000193AF c 21 1 Sen THC2226n2 


M00004249DF10 




RTAOO000 1 93 AF c 2 1 1 


M0000427Sr-n 1 


16914 


99 AO <:n6* 1 3 1 238 Sen 


M0000427Sr'n 1 

IVlUuvUti / J^.V/ 1 1 


I U7 1 *T 


RTA00000193AF f S 1 

1\ 1 rtuUUUU 1 7 J Al .l.J.l 


M00004283RA04 


14286 


RTA00000193AF f 22 1 






RTAflflftOniQI AF a 9 1 
ix i /w/v/ui/u 1 7J/\r .g.Zr. i 


M00004327B:H04 




RTA00000193AF.j.20.1 


M00004377C:F05 


2102 


RTA00000193AF.n.7.1 


M00004384C:D02 




RTA00000193AF.n.l5.1 


M00004384C:D02 




RTA00000 193 AF.n. 15.1 .Seq_THC21 5687 


M00004461A:B08 




RTA00000194AR.a.l0.2 


M00004461A:B09 




RTA00000194AF.a.ll.l 


M00004691D:A05 




RTA00000194AF.C.23.1 


M00004896A:C07 




RTA00000194AF.d.l3.1 



The above material has been deposited with the American Type Culture Collection, 
Rockville, Maryland, under the accession number indicated. This deposit will be maintained 
under the terms of the Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for purposes of Patent Procedure. The deposit will be maintained for a 
period of 30 years following issuance of this patent, or for the enforceable life of the patent, 
whichever is greater. Upon issuance of the patent, the deposit will be available to the public 
from the ATCC without restriction. 

This deposit is provided merely as convenience to those of skill in the art, and is not 
an admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid sequence 
of the polypeptides encoded thereby, are incorporated herein by reference and are controlling 
in the event of any conflict with the written description of sequences herein. A license may 
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be required to make, use, or sell the deposited material, and no such license is granted 

hereby. 



Retrieval of Individual Clones from Deposit of Pooled Clones 
5 Where the ATCC deposit is composed of a pool of cDNA clones, the deposit was 

prepared by first transfecting each of the clones into separate bacterial cells. The clones 
were then deposited as a pool of equal mixtures in the composite deposit. Particular clones 
can be obtained from the composite deposit using methods well known in the art. For 
example, a bacterial cell containing a particular clone can be identified by isolating single 

10 colonies, and identifying colonies containing the specific clone through standard colony 
hybridization techniques, using an oligonucleotide probe or probes designed to specifically 
hybridize to a sequence of the clone insert (e.g., a probe based upon unmasked sequence of 
the encoded polynucleotide having the indicated SEQ ID NO). The probe should be 
designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 

15 each G or C). Positive colonies can then be picked, grown in culture, and the recombinant 
clone isolated. Alternatively, probes designed in this manner can be used to PCR to isolate a 
nucleic acid molecule from the pooled clones according to methods well known in the art, 
e.g., by purifying the cDNA from the deposited culture pool, and using the probes in PCR 
reactions to produce an amplified product having the corresponding desired polynucleotide 

20 sequence. 
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Table 1. Sequence identification numbers, cluster ID, sequence name, and clone name 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


1 


4635 


RTA00000180AF.L20.1 


M00001429B:A11 


2 




RTA00000185AF.n.l2.1 


M00001608D:A11 


3 


4622 


RTA00000187AF.m.l5.2 


M00001686A:E06 


4 


3706 


RTA00000191AF.L17.2 


M00004068B:A01 


5 


36535 


RTA00000181AF.f.5.1 


M00001449A:G10 


6 


3990 


RTA00000183AF.j.ll.l 


M00001532B:A06 


7 


5319 


RTA00000192AF.U2.1 


M00004169C:C12 


8 


36393 


RTA00000180AF.C.2.1 


M00001417A:E02 


9 


2623 


RTA00000183AF.a.6.1 


M00001497A:G02 


10 


7587 


RTA00000178AF.n.24.1 


M00001387B:G03 


11 


7065 


RTA00000137A.g.6.1 


M00001557A:D02 


12 


10539 


RTA00000187AF17.1 


M00001680D:F08 


13 


27250 


RTA00000181AF.g.l0.1 


M00001450A:D08 


14 


5556 


RTA00000179AF.n.l0.1 


M00001407B:D11 


15 




RTA00000192AF.m.l2.1 


M00004191D:B11 


16 


8761 


RTA00000184AF.k.l2.1 


M00001557D:D09 


17 


4622 


RTA00000189AF.g.l.l 


M00003856B:C02 


18 


11460 


RTA00000187AF.g.l2.1 


M00001676B:F05 


19 


16283 


RTA00000120A.O.20.1 


M00001467A:D08 


20 


3430 


RTA00000191AF.a.9.1 


M00003981A:E10 


21 


7065 


RTA00000184AFJ.21.1 


M00001557A:D02 


22 




RTA00000182AF.1.20.1 


M00001488B:F12 


23 




RTA00000123A.g.l9.1 


M00001531A:H11 


24 


16918 


RTA00000193AF.a.l6.1 


M00004223A:G10 


25 


16914 


RTA00000193AF.f.5.1 


M00004275C:C11 


26 


40108 


RTA00000187AF.O.24.1 


M00003741D:C09 


27 


14286 


RTA00000193AF.f.22.1 


M00004283B:A04 


28 


17004 


RTA00000186AF.D.21.1 


M00001617C:E02 


29 




RTA00000180AF.g.22.1 


M00001426B:D12 


30 


13272 


RTA00000192AF.e.3.1 


M00004138B:H02 


31 




RTA00000194AF.f.4.1 


M00005180C:G03 


32 


32663 


RTA00000118A.1.8.1 


M00001450A:A11 


33 




RTA00000180AF.a.9.1 


M00001414A:B01 


34 


5832 


RTA00000178AF.O.23.1 


M00001388D:G05 


35 


7801 


RTA00000181AF.C.21.1 


M00001446A:F05 


36 


76760 


RTA00000187AF.a.l5.1 


M00001657D:F08 


37 


40132 


RTA00000178AF.C.7.1 


M00001365C:C10 


38 




RTA00000183AF.e.l.l 


M00001505C:C05 


39 


4016 


RTA00000118A.C.4.1 


M00001395A:C03 


40 


5382 


RTA00000187AF.m.23 .2 


M00001688C:F09 
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WO 99/33982 
SEQIDNO: Cluster ID Sequence Name 



41 


5693 


RTA00000190AF.p.l7.2 


42 


307 


RTA00000136A.O.4.2 


43 


39833 


RTA00000178AF.I.23.1 


44 




RTA00000193AF.m.5.1 


45 


5325 


RTA00000191AF.O.6.1 


46 


5325 


RTA00000191AF.O.6.2 


47 


18957 


RTA00000190AR.m.9.1 


48 


39508 


RTA00000120A.O.2.1 


49 


22390 


RTA00000136A.j.l3.1 


50 


12170 


RTA00000125A.h.l8.4 


51 


4393 


RTA0OO'0O187AF.n.l7.1 


52 


19 


RTA00000182AF.b.7.1 


53 




RTA00000193AF.C.21.1 


54 


7899 


RTA00000189AF.C.10.1 


55 


40073 


RTA00000191AF.e.3.1 


56 


7005 


RTA00000179AF.O.22.1 


57 




RTA00000187AF.h.22.1 


58 


18957 


RTA00000190AF.m.9.2 


59 


18957 


RTA00000183AF.h.23.1 


60 


16283 


RTA00000182AF.C.22.1 


61 


6974 


RTA00000183AF.d.9.1 


62 


2623 


RTA00000183AF.b.l4.1 


63 


9105 


RTA00000191AF.a.21.2 


64 


13238 


RTA00000181AF.m.4.1 


65 


5749 


RTA00000185AF.3.19.1 


66 


6455 


RTA00000193AF.b.9.1 


67 


23001 


RTA00000185AF.C.24.1 


68 


6455 


RTA00000192AF.g.23.1 


69 


13595 


RTA00000189AF.f.8.1 


70 


39442 


RTA00000120A.O.21.1 


71 


17036 


RTA00000191AF.f.l3.1 


72 




RTA00000183AF.g.9.1 


73 


7005 


RTA00000181AF.k.24.1 


74 


6268 


RTA00000126A.O.23.1 


75 


16130 


RTA00000119A.C.13.1 


76 


23201 


RTA00000187AF.a.l4.1 


77 


5321 


RTA00000183AF.k.8.1 


78 


13157 


RTA00000186AF.a.6.1 


79 


2102 


RTA00000193AF.n.7.1 


80 


1058 


RTA00000126A.e.20.3 


81 


40392 


RTA00000180AF.j.8.1 


82 




RTA00000183AF.e.23.1 


83 


11476 


RTA00000187AF.p.l9.1 
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PCT/US98/27610 
Clone Name 

M00003978B:G05 

M00001552A:B12 

M00001378B:B02 

M00004359B:G02 

M00004093D:B12 

M00004093D:B12 

M00003958A:H02 

M00001467A:D04 

M00001551A:G06 

M00001544A:E03 

M00001693C:G01 

M00001463C:B11 

M00004249D:F10 

M00003837D:A01 

M00004028D:C05 

M00001410A:D07 

M00001679A:F06 

M00003958A:H02 

M00001528A:F09 

M00001467A:D08 

M00001504C:H06 

M00001500A:E11 

M00003983A:A05 

M00001455A:E09 

M00001571C:H06 

M00004229B:F08 

M00001578B:E04 

M00004157C:A09 

M00003851B:D10 

M00001467A:E10 

M00004035D:B06 

M00001513B:G03 

M00001454B:C12 

M00001551A:B10 

M00001453A:E11 

M00001657D:C03 

M00001534A:F09 

M00001614C:F10 

M00004377C:F05 

M00001548A:H09 

M00001429D:D07 

M00001506D:A09 

M00003747D:C05 



WO 99/33982 PCT/US98/2761 0 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


84 


3584 


RTA00000177AF.h.20.1 


M00001349B:B08 


85 


10470 


RTA0000O18OAF.f.l8.1 


M00001424B:G09 


86 


39425 


RTA00000133A.f.l.l 


M00001470A:C04 


87 


5175 


RTA00000184AF.f.3.1 


M00001550A:G01 


88 


13576 


RTA00000189AF.O.13.1 


M00003885C:A02 


89 


7665 


RTA00000134A.1.19.1 


M00001535A:B01 


90 


16927 


RTA00000177AF.h.9.3 


M00001348B.B04 


91 


6660 


RTA00000187AF.h.l5.1 


M00001679A:A06 


92 


2433 


RTA00000191AF.a.l5.2 


M00003982C:C02 


93 


5097 


RTA00000134A.k.l.l 


M00001534A:D09 


94 


21847 


RTA00000193AF.j.9.1 


M00004318C:D10 


95 


3277 


RTA00000138A.1.5.1 


M00001624A:B06 


96 


5708 


RTA00000184AF.g.l2.1 


M00001552B:D04 


97 


945 


RTA00000178AR.a.20.1 


M00001362C:H11 


98 


16269 


RTA00000178AF.p.l.l 


M00001389A:C08 


99 




RTA00000183AF.C.24.1 


M00001504A:E01 


100 


16731 


RTA00000181AF.a.20.1 


M00001442C:D07 


101 


12439 


RTA00000190AF.O.24.1 


M00003975A:G11 


102 


3162 


RTA00000177AF.j.l2.3 


M00001351B:A08 


103 




RTA00000194AF.b.l9.1 


M00004505D:F08 


104 




RTA00000193AF.n.l5.1 


M00004384C:D02 


105 




RTA00000186AF.n.7.1 


M00001651A:H01 


106 


10717 


RTA00000181AF.d.l0.1 


M00001447A:G03 


107 


4573 


RTA00000189AFJ.12.1 


M00003871C:E02 


108 




RTA00000186AF.h.l4.1 


M00001632D:H07 


109 


11443 


RTA00000192AF.1.13.2 


M00004185C:C03 


110 


5892 


RTA00000184AF.d.ll.l 


M00001548A:E10 


111 


3162 


RTA00000177AF.j.l2.1 


M00001351B:A08 


112 


10470 


RTA00000185AF.k.6.1 


M00001597D:C05 


113 


17055 


RTA00000187AF.m.3.1 


M00001682C:B12 


114 


2030 


RTA00000193AF.m.20.1 


M00004372A:A03 


115 


6558 


RTA00000184AF.m.21.1 


M00001560D:F10 


116 


23255 


RTA00000190AF.j.4.1 


M00003922A:E06 


117 


9577 


RTA00000179AF.O.17.1 


M00001409C:D12 


118 




RTA00000180AF.a.ll.l 


M00001414C:A07 


119 


8 


RTA00000181AF.e.l7.1 


M00001448D:C09 


120 


67907 


RTA00000188AF.g.ll.l 


M00003774C:A03 


121 


12081 


RTA00000133A.d.l4.2 


M00001469A:C10 


122 


2448 


RTA00000119AJ.21.1 


M00001460A:F06 


123 


3389 


RTA00000189AF.g.3.1 


M00003857A:G10 


124 


39174 


RTA00000124A.n.l3.1 


M00001541A:H03 


125 


24488 


RTA00000190AF.n.l6.1 


M00003968B:F06 


126 


8210 


RTA00000192AF.n.l3.1 


M00004197D:H01 
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SEQIDNO: Cluster ID Sequence Name 



127 




RTA00000135A.I.2.2 


128 


40455 


RTA00000190AF.m.l0.2 


129 


9577 


RTA00000180AF.d.23.1 


130 


13183 


RTA00000192AF.a.24.1 


131 


5214 


RTA00000186AF.g.ll.l 


132 


67252 


RTA00000187AF.O.6.1 


133 


3108 


RTA00000188AF.d.24.1 


134 


2464 


RTA00000178AF.n.l8.1 


135 


36313 


RTA00000181AF.e.23.1 


136 


23255 


RTA00000177AF.e.l4.3 


137 


7985 


RTA00000182AR.j.2.1 


138 


8286 


RTA00000183AF.O.1.1 


139 


22195 


RTA00000180AF.g.7.1 


140 


4573 


RTA00000184AF.h.9.1 


141 


26875 


RTA00000187AF.U.1 


142 


7187 


RTA00000177AF.i.8.2 


143 


86859 


RTA00000118A.p.8.1 


144 


4623 


RTA00000185AF.f.4.1 


145 




RTA00000121A.C.10.1 


146 


10185 


RTA00000183AF.d.5.1 


147 




RTA00000183AF.p.4.1 


148 


15069 


RTA00000191AF.1.6.1 


149 


39304 


RTA00000118A.j.21.1 


150 


8672 


RTA00000190AF.f.ll.l 


151 


13576 


RTA00000177AF.g.l6.1 


152 


6293 


RTAOOO0O185AF.e.ll.l 


153 


16977 


RTA00000192AF.g.3.1 


154 


5345 


RTA00000189AF.1.19.1 


155 


4905 


RTA00000193AF.e.l4.1 


156 


17036 


RTA00000191AF.j.l0.1 


157 


5417 


RTA00000191AF.h.l9.1 


158 


7172 


RTA00000178AF.f.9.1 


159 


40044 


RTA00000186AF.d.l.l 


160 


4386 


RTA00000184AF.j.4.1 


161 


40044 


RTA00000183AF.g.22.1 


162 


9685 


RTA00000183AF.C.11.1 


163 


22155 


RTA00000185AF.n.9.1 


164 


10515 


RTA00000189AF.f.l8.1 


165 


6539 


RTA00000185AF.d.ll.l 


166 


15066 


RTA00000180AF.e.24.1 


167 


4261 


RTA00000180AF.h.5.1 


168 


13864 


RTA00000125A.m.9.1 


169 


6539 


RTA00000189AF.d.22.1 
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PCT/US98/27610 
Clone Name 
M00001545A:B02 
M00003958C:G10 
M00001421C:F01 
M00004114C:F11 
M00001630B:H09 
M00001716D:H05 
M00003763A:F06 
M00001387A:C05 
M00001448D:H01 
M00001343D:H07 
M00001481D:A05 
M00001540A:D06 
M00001425B:H08 
M00001553B:F12 
M00001679A:F10 
M00001350A:H01 
M00001452A:B12 
M00001586C:C05 
M00001469A:A01 
M00001504C:A07 
M00001542B:B01 
M00004081C:D10 
M00001450A:A02 
M00003909D:C03 
M00001347A:B10 
M00001583D:A10 
M00004151D:B08 
M00003879B:C11 
M00004269D:D06 
M00004072B:B05 
M00004059A:D06 
M00001371C:E09 
M00001621C:C08 
M00001556B:C08 
M00001514C:D11 
M00001501D:C02 
MOOO016O8B:EO3 
M00003853A:F12 
M00001579D:C03 
M00001423B:E07 
M00001426D:C08 
M00001545A:D08 
M00003844G:B1 1 



WO 99/33982 PCT/US98/27610 



SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


170 


11465 


RTA00000185AF.m.l9.1 


M00001607A:E11 


171 


3266 


RTA00000184AR.g.l.l 


M00001551C:G09 


172 


102 


RTA00000184AF.O.5.1 


M00001563B:F06 


173 


16970 


RTA00000181AR.1.18.2 


M00001452C:B06 


174 


12971 


RTA00000193AF.a.20.1 


M00004223D:E04 


175 


5007 


RTA00000177AF.g.2.1 


M00001346A:F09 


176 


3765 


RTA0OO00135A.d.l.l 


M00001541A:D02 


177 


11294 


RTA00000184AFJ.6.1 


M00001556B:G02 


178 


3681 


RTA00000131A.g.l5.2 


M00001449A:D12 


179 


9283 


RTA00000181AR.m.21.2 


M00001455D:F09 


180 


18699 


RTA00000182AF.m.l6.1 


M00001490B:C04 


181 


86110 


RTA00000181AF.f.l2.1 


M00001449C:D06 


182 


39648 


RTA00000178AR.I.8.2 


M00001383A:C03 


183 


7337 


RTA00000123A.b.l7.1 


M00001528A:C04 


184 


1334 


RTA00000178AF.j.7.1 


M0O0O1379A:AO5 


185 


17076 


RTA00000188AF.d.21.1 


M00003762C:B08 


186 


22794 


RTAO000O138A.b.5.1 


M00001601A:D08 


187 


39171 


RTA00000186AF.1.7.1 


M00001644C:B07 


188 


8551 


RTA00000179AF.p.21.1 


M00001412B:B10 


189 


5857 


RTA00000118A.g.l4.1 


M00001449A:A12 


190 


9443 


RTA00000183AF.C.1.1 


M00001500C:E04 


191 


9457 


RTA00000193AF.U4.2 


M00004307C-.A06 


192 


7206 


RTA00000182AF.O.15.1 


M00001494D:F06 


193 


22979 


RTA00000178AF.k.22.1 


M00001382C:A02 


194 


40455 


RTA00000190AR.m.l0.1 


M00003958C:G10 


195 


7221 


RTA00000191AF.p.9.1 


M00004105C:A04 


196 




RTA00000191AFJ.9.1 


M00004072A:C03 


197 


7239 


RTA00000126A.m.4.2 


M00001550A:A03 


198 


31587 


RTA00000189AF.1.20.1 


M00003879B:D10 


199 


16317 


RTA00000190AF.e.6.1 


M00003907D:H04 


200 


13576 


RTA00000189AR.O.13.1 


M00003885C:A02 


201 


5779 


RTA00000177AF.g.l4.3 


M00001346D:G06 


202 


6124 


RTA00000191AR.e.2.3 


M00004028D:A06 


203 


9952 


RTA00000180AF.C.20.1 


M00001418B:F03 


204 




RTA00000188AF.i.8.1 


M00003784D:D12 


205 


5779 


RTA00000177AF.g.l4.1 


M00001346D:G06 


206 


39490 


RTA00000128A.b.4.1 


MOOOO1557A:F03 


207 


4416 


RTA00000187AF.h.l3.1 


M00001678D:F12 


208 


4009 


RTA00000179AF.e.20.1 


M00001396A:C03 


209 


5336 


RTAO0O0O183AF.b.l3.1 


M00001500A:C05 


210 


39186 


RTA00000121A.p.l5.1 


M00001512A:A09 


211 


40122 


RTA00O0O190AF.n.23.1 


M00003970C:B09 


212 


12532 


RTA00000190AF.g.2.1 


M00003912B:D01 
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SEQIDNO: Cluster ID 


Sequence Name 


213 


8078 


RTA00000177AR.1.13.1 


214 


3900 


RTA00000190AF.g.l3.1 


215 


7589 


RTA00000120A.p.23.1 


216 


8298 


RTA00000127A.d.l9.1 


217 


4443 


RTA00000177AF.b.20.4 


21S 


26295 


RTAOOOOO 1 93 AF.i.24.2 


219 


3389 


RTA00000183AF.m.l9.1 


220 


7015 


RTA00000187AF.f.l8.1 


221 


8526 


RTA00000180AF.d.l.l 


222 


4665 


RTA00000186AF.m.3.1 


223 


1399 


RTA00000129A.O.10.1 


224 


9244 


RTA00000127A.1.3.1 


225 




RTA00000179AF.j.l3.1 


226 


82498 


RTA00000118A.m.l0.1 


227 


35702 


RTA00000187AR.C.15.2 


228 


38759 


RTA00000120A.m.l2.3 


229 


39648 


RTA00000178AF.1.8.1 


230 


19105 


RTA00000133A.e.l5.1 


231 


85064 


RTA00000131A.m.23.1 


232 


9285 


RTAOOOOO 191 AF.m. 18.1 


233 


9285 


RTA00000190AF.d.7.1 


234 


39391 


RTA00000138A.C.3.1 


235 




RTAOOOOO 178AF.d.20.1 


236 


39498 


RTAOOOOO 119AJ.20.1 


237 


7798 


RTAOOOOO 189AF.k. 12.1 


238 


7798 


RTAOOOOO 189AF.C. 18.1 


239 


19829 


RTAOOOOO 125A.h.24.4 


240 




RTAOOOOO 188AF.d. 11.1 


241 


4275 


RTAOOOOO 120A.j. 14.1 


242 


22113 


RTAOOOOO 125A.C.7.1 


243 


40314 


RTAOOOOO 186AF.C. 15.1 


244 


10944 


RTAOOOOO 126A.h. 17.2 


245 


39809 


RTA00000190AF.e.3.1 


246 


22085 


RTA00000135A.e.5.2 


247 


19255 


RTA00000135A.m.l8.1 


248 


14311 


RTAOOOOO 1 92 AF.o.2.1 


249 


8479 


RTAOOOOO 189AFJ.22.1 


250 




RTA00000189AFJ.23.1 


251 


4193 


RTA00000184AF.e.l3.1 


252 


22814 


RTA00000184AF.h.l4.1 


253 


39563 


RTAOOOOO 179AF.k.20.1 


254 


39420 


RTA00000134A.O.23.1 


255 


11589 


RTAOOOOO 177AF.b. 17.4 
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Clone Name 

M00001353A:G12 

M00003914C:F05 

M00001468A:F05 

M00001553A:H06 

M00001341A:E12 

M00004312A:G03 

M00001537B:G07 

M00001673C:H02 

M00001418D:B06 

M00001648C:A01 

M00001604A:B10 

M00001556A:C09 

M00001400B:H06 

M00001450A:B12 

M00001663A:E04 

M00001467A:B07 

M000O1383A:CO3 

M00001469A:H12 

M00001452A:F05 

M00004086D:G06 

M00003906C:E10 

M00001604A:F05 

M00001368D:E03 

M00001460A:F12 

M00003876D:E12 

M00003839A:D08 

M00001544A:G02 

M00003761D:A09 

M00001466A:E07 

M00001542A:A09 

M00001619C:F12 

M00001549A:D08 

M00003907D:A09 

M00001541A:F07 

M00001545A:C03 

M00004203B:C12 

MOO0O3875C:GO7 

M000O3875D:Dl 1 

M00001549B:F06 

M000O1553D:D10 

M00001402A:E08 

M00001537A:F12 

M00001340D:F10 
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SEQ TD NO: Cluster ID 


Sequence Name 


256 


4937 


RTA00000191AF.p.21.1 


257 


39412 


RTA00000133A.k.l7.1 


258 


4837 


RTA00000185AR.k.3.2 


259 


13046 


RTA00000193AF.h.l9.1 


260 


4141 


RTA00000177AF.p.20.3 


261 


38085 


RTA00000123A.e.l5.1 


262 




RTA00000189AF.p.8.1 


263 


11451 


RTA00000192AF.p.l7.1 


264 


14507 


RTA00000189AR.1.23.2 


265 


40054 


RTA00000180AF.p.l0.1 


266 


39423 


RTA00000134A.k.22.1 


267 


39453 


RTA00000135A.g.ll.l 


268 


10751 


RTA00000187AF.k.7.1 


269 


10751 


RTA00000187AF.k.6.1 


270 


78091 


RTA00000187AF.j.6.1 


271 


39539 


RTA00000127A.i.21.1 


272 




RTA00000182AF.1.15.1 


273 




RTA00000194AF.d.l3.1 


274 




RTA00000128A.C.20.1 


275 


9283 


RTA00000181AR.m.22.2 


276 


39168 


RTA00000121A.1.10.1 


277 


39458 


RTA00000126A.p.l5.2 


278 


14391 


RTA00000177AF.m.l7.3 


279 


39195 


RTA00000137A.C.16.1 


280 


7212 


RTA00000193AF.b.l4.1 


281 


4015 


RTA00000136A.e.l2.1 


282 


12977 


RTA00000189AF.j.l9.1 


283 




RTA00000178AF.m.l3.1 


284 


14391 


RTA00000191AF.1.7.1 


285 




RTA00000194AF.C.23.1 


286 




RTA00000181AF.b.7.1 


287 


8358 


RTA00000183AF.i.5.1 


288 


1267 


RTA00000125A.O.5.1 


289 




RTA00000189AF.f.7.1 


290 


16347 


RTA00000184AF.e.l5.1 


291 


7899 


RTA00000193AF.E.17.1 


292 


2379 


RTA00000178AF.a.6.1 


293 


39478 


RTA0O000133A.J.5.1 


294 


39392 


RTA00000134A.m.l6.1 


295 


5053 


RTA00000184AF.O.12.1 


296 


16999 


RTA00000185AF.k.9.1 


297 


39180 


RTA00000126A.n.8.2 


298 


1037 


RTA00000121A.f.8.1 



PCT/US98/27610 
Clone Name 

M00004108A:E06 

M00001511A:H06 

M00001597C:H02 

M00004296C:H07 

M00001361A:A05 

M00001531A:D01 

M00003891C:H09 

M00004214C:H05 

M00003879D:A02 

M00001439C:F08 

M00001535A:F10 

M00001542A:E06 

M00001679D:D03 

M00001679D:D03 

M00001679C:F01 

M00001555A:B02 

M00001487B:H06 

M00004896A:C07 

M00001558A:H05 

M00001455D.F09 

M00001507A:H05 

M00001552A:D11 

M00001355B:G10 

M00001555A:C01 

M00004230B:C07 

M00001549A:B02 

M00003875B:F04 

M00001384B:A11 

M00004081C:D12 

M00004691D:A05 

M00001443B:F01 

M00001528B:H04 

M00001546A:G11 

M00003851B:D08 

M00001549C:E06 

M00004223B:D09 

M00001361D:F08 

M00001471A:B01 

M00001536A:C08 

M00001564A:B12 

M00001598A:G03 

M00001551A:F05 

M00001470A:B10 
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SEQ ID NO: Cluster ED 


Sequence Name 


Clone Name 


299 


6867 


RTA00000178AF.e.l2.1 


M00001370A:C09 


300 


10539 


RTA00000183AF.a.24.1 


M00001499B:A11 


301 


41633 


RTA00000118A.g.l6.1 


M00001449A:B12 


302 


23218 


RTA00000187AR.C.5.2 


M00001662C:A09 


303 


39380 


RTA00000129A.e.24.1 


M00001587A:B11 


304 




RTA00000185AF.d.24.1 


M00001582D:F05 


305 




RTA00000177AF.O.4.3 


M00001358C:C06 


306 


6974 


RTA00000184AF.a.l5.1 


M00001544B:B07 


307 




RTA00000185AF.g.ll.l 


M00001590B:F03 


308 


15855 


RTA00000184AF.j.l.l 


M00001556A:H01 


309 


84328 


RTA00000118A.p.l0.1 


M00001452A:B04 


310 


10145 


RTA00000120A.g.l2.1 


M00001465A:B11 


311 


39805 


RTA00000177AF.C.21.3 


M00001342B:E06 


312 




RTA00000187AF.h.23.1 


M00001679A:F06 


313 


6298 


RTA00000187AR.1.10.2 


M00001679B:F01 


314 


14367 


RTA00000187AF.e.8.1 


M00001670C:H02 


315 




RTA00000193AF.C.22.1 


M00004249D:G12 


316 


16921 


RTA00000183AF.k.6.1 


M00001534A:C04 


317 


1577 


RTA00000184AF.i.23.1 


M00001556A:F11 


318 


8773 


RTA00000187AF.f.24.1 


M00001675A:C09 


319 




RTA00000194AF.a.ll.l 


M00004461A:B09 


320 


39886 


RTA00000178AF.j.24.1 


M00001380D:B09 


321 


13532 


RTA00000181AF.C.4.1 


M00001445A:F05 


322 




RTA00000193AF.d.2.1 


M00004251C:G07 


323 


5257 


RTA00000192AF.O.1 


M00004146C:C11 


324 


9061 


RTA00000191AR.e.ll.2 


M00004031A:A12 


325 


19267 


RTA00000186AF.I.12.1 


M00001645A:C12 


326 


20212 


RTA00000134A.1.22.1 


M00001535A:C06 


327 


16653 


RTA000O0181AF.k.5.3 


M00001453C:F06 


328 


16985 


RTA00000177AF.h.l0.1 


M00001348B:G06 


329 


12977 


RTA00000189AR.j.l9.1 


M00003875B:F04 


330 


9061 


RTA00000191AR.e.ll.3 


M0000403IA:A12 


331 




RTA00000194AR.a.l0.2 


M00004461A:B08 


332 


6468 


RTA00000187AF.d.l5.1 


M00001669B:F02 


333 


16392 


RTA00000192AF.1.1.1 


M00004183C:D07 


334 


14627 


RTA00000187AF.g.23.1 


M00001677C:E10 


335 


6583 


RTA00000179AF.d.l3.1 


M00001394A:F01 


336 


6806 


RTA00000177AF.g.l3.3 


M00001346D:E03 


337 


9635 


RTA00000137A.e.23.4 


M00001557A:F01 


338 


689 


RTA00000181AR.1.22.1 


M00001454D:G03 


339 


4119 


RTA00000183AF.k.l6.1 


M00001534C:A01 


340 


8952 


RTA00000183AF.h.l5.1 


M00001518C:B11 


341 


2379 


RTA00000192AF.p.8.1 


M00004212B:C07 
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SEQIDNO: Cluster ID Sequence Name 



342 


39486 


RTA00000128A.m.22.2 


343 


21877 


RTA00000189AF.b.21.1 


344 


6874 


RTA00000192AF.a.l4.1 


345 


6874 


RTA00000189AF.e.9.1 


346 


37285 


RTA00000191AF.f.ll.l 


347 




RTA00000193AF.j.20.1 


348 


7674 


RTA00000118A.g.9.1 


349 


2797 


RTA00000180AF.L19.1 


350 




RTA00000184AF.g.22.1 


351 


7802 


RTA000OO185AF.n.5.1 


352 


16921 


RTA000OO193AF.h.l5.1 


353 


11494 


RTA00000192AF.j.6.1 


354 


17062 


RTA00000177AF.b.8.4 


355 


16245 


RTA00000177AF.k.9.3 


356 


83103 


RTA00000119A.e.24.2 


357 


4309 


RTA00000186AF.e.22.1 


358 


13072 


RTA00000181AR.m.5.2 


359 


4059 


RTA00000177AF.n.l8.3 


360 


5178 


RTA00000178AF.n.l0.1 


361 


1120 


RTA00000118A.p.l5.3 


362 


6420 


RTA00000183AF.d.ll.l 


363 


13913 


RTA00000186AF.e.6.1 


364 




RTA00000192AF.C.2.1 


365 


3956 


RTA00000183AF.g.3.1 


366 


14364 


RTA00000183AF.g.l2.1 


367 


6880 


RTA00000191AF.m.20.1 


368 


84182 


RTA00000180AF.h.l9.1 


369 


2790 


RTA00000177AF.e.2.1 


370 


4561 


RTA00000184AF.i.21.1 


371 


8847 


RTAO000O180AF.b.l6.1 


372 


56020 


RTA00000193AF.g.2.1 


373 


1531 


RTA00000119A.O.3.1 


374 


6420 


RTA00000177AF.f.l0.3 


375 




RTA00000188AF.b.l2.1 


376 




RTA00000180AF.k.24.1 


377 




RTA00000184AF.a.8.1 


378 


2696 


RTA00000134A.m.l3.1 


379 


260 


RTA00000185AR.i.l2.2 


380 


11350 


RTA00000189AF.a.24.2 


381 


2428 


RTA00000123A.1.21.1 


382 


4313 


RTA00000122A.n.3.1 


383 




RTA00000184AF.p.3.1 


384 


697 


RTA00000188AF.d.6.1 



PCT/US98/27610 
Clone Name 
M00001561A:C05 
M00003833A:E05 
M00004111D:A08 
M00003846B:D06 
M00004035C:A07 
M00004327B:H04 
M00001416A:H01 
M00001429A:H04 
M00001552D:A01 
M00001608A:B03 
M00004295D:F12 
M00004172C:D08 
M00001340B.-A06 
M00001352A:E02 
M00001454A:A09 
M00001624C:F01 
M00001455B:E12 
M00001357D:D11 
M00001386C:B12 
M00001452A:D08 
M00001504D:G06 
M00001623D:F10 
M00004121B:G01 
M00001512D:G09 
M00001513C:E08 
M00004087D:A01 
M00001428A:H10 
M00001343C:F10 
M00001555D:G10 
M00001416B:H11 
M00004285B:E08 
M00001461A:D06 
M00001345A:E01 
M00003754C:E09 
M00001432C:F06 
M000O1544A:E06 
M000O1536A:B07 
M00001594B:H04 
M00003826B:A06 
M00001533A:C11 
M00001517A:B07 
M00001566B:D11 
M00003759B:B09 



149 



WO 99/33982 
SEQIDNO: Cluster ID Sequence Name 



385 


5619 


RTA00000188AF.I.9.1 


386 


4568 


RTA00000122A.d.l5.3 


387 




RTA00000177AF.i.6.2 


388 


5622 


RTA00000178AF.a.U.l 


389 


7514 


RTA00000184AF.k.21.1 


390 


5619 


RTA00000189AF.f.l7.1 


391 


7570 


RTA00000187AF.g.24.1 


392 


23358 


RTA00000190AF.O.21.1 


393 


23210 


RTA00000190AF.O.20.1 


394 


5192 


RTA00000184AF.k.2.1 


395 


13538 


RTA00000180AF.3.24.1 


396 




RTA00000189AF.h.l7.1 


397 




RTA00000192AF.O.11.1 


398 




RTA00000184AF.I.11.1 


399 


4718 


RTA00000189AF.g.5.1 


400 


14929 


RTA00000177AF.m.l.2 


401 


4908 


RTA00000192AF.j.2.1 


402 




RTA00000178AF.k.l6.1 


403 




RTA00000194AF.C.24.1 


404 


17732 


RTA00000178ARJ.2.2 


405 


17062 


80.Al.sp6:130208.Seq 


406 


11589 


80.Bl.sp6:130220.Seq 


407 


4443 


80.Cl.sp6:130232.Seq 


408 


39805 


80.Dl.sp6:130244.Seq 


409 


2790 


80.El.sp6:130256.Seq 


410 


23255 


80.Fl.sp6:130268.Seq 


411 


6420 


80.Gl.sp6:130280.Seq 


412 


5007 


80.Hl.sp6:130292.Seq 


413 


13576 


80.D2.sp6:130245.Seq 


414 


16927 


80.E2.sp6:130257.Seq 


415 


16985 


80.F2.sp6:130269.Seq 


416 


3584 


80.G2.sp6: 13028 l.Seq 


417 




80.H2.sp6:130293.Seq 


418 


7187 


80.A3.sp6:130210.Seq 


419 


16245 


80.D3.sp6:130246.Seq 


420 


8078 


80.E3.sp6:130258.Seq 


421 


14929 


80.F3.sp6:130270.Seq 


422 


14391 


80.G3.sp6:130282.Seq 


423 


4141 


80.B4.sp6:130223.Seq 


424 


2379 


80.C4.sp6:130235.Seq 


425 


5622 


80.D4.sp6:130247.Seq 


426 


945 


80.E4.sp6:130259.Seq 


427 


40132 


80.F4.sp6:130271.Seq 
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PCT/US98/27610 
Clone Name 
M00003796C:D05 
M00001513A:B06 
M00001350A:B08 
M00001362B:D10 
M00001558B:H11 
M00003853A:D04 
M00001677D:A07 
M00003974D:H02 
M00003974D:E07 
M00001557B:H10 
M00001415A:H06 
M00003867A:D10 
M00004205D:F06 
M00001559B:F01 
M00003857A:H03 
MOO0O1353D.D10 
M00004171D:B03 
M00001381D:E06 
M00004692A:H08 
M00001376B:G06 
M00001340B:A06 
M00001340D:F10 
M00001341A:E12 
M00001342B:E06 
M00001343C:F10 
M00001343D:H07 
M00001345A:E01 
M00001346A:F09 
M00001347A:B10 
M00001348B.-B04 
M00001348B:G06 
M00001349B:B08 
M00001350A:B08 
M00001350A:H01 
M00001352A:E02 
M00001353A:G12 
M00001353D:D10 
M00001355B:G10 
M00001361A:A05 
M00001361D:F08 
M00001362B:D10 
M00001362C:H11 
M00001365C:C10 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


428 




80.G4.sp6:130283.Seq 


429 


6867 


80.H4.sp6:130295.Seq 


430 


7172 


80.A5.sp6:130212.Seq 


431 


17732 


80.B5.sp6:130224.Seq 


432 


39833 


80.C5.sp6:130236.Seq 


433 


1334 


80.D5.sp6:130248.Seq 


434 


39886 


80.E5.sp6:130260.Seq 


435 




80.F5.sp6:130272.Seq 


436 


22979 


80.G5.sp6:130284.Seq 


437 


39648 


80.H5.sp6:130296.Seq 


438 




80.B6.sp6:130225.Seq 


439 


5178 


80.C6.sp6:130237.Seq 


440 


2464 


80.D6.sp6:130249.Seq 


441 


7587 


80.E6.sp6:130261.Seq 


442 


5832 


80.F6.sp6:130273.Seq 


443 


16269 


80.G6.sp6:130285.Seq 


444 


6583 


80.H6.sp6:130297.Seq 


445 


4009 


80.A7.sp6:130214.Seq 


446 




80.B7.sp6:130226.Seq 


447 


39563 


80.C7.sp6:130238.Seq 


448 


5556 


80.D7.sp6:130250.Seq 


449 


9577 


80.E7.sp6:130262.Seq 


450 


7005 


80.F7.sp6:130274.Seq 


451 


8551 


80.G7.sp6:130286.Seq 


452 




80.H7.sp6:130298.Seq 


453 




80.A8.sp6:130215.Seq 


454 


13538 


80.B8.sp6:130227.Seq 


455 


8847 


80.C8.sp6:130239.Seq 


456 


36393 


80.D8.sp6:130251.Seq 


457 


9952 


80.E8.sp6:130263.Seq 


458 


9577 


80.G8.sp6:130287.Seq 


459 


15066 


80.H8.sp6:130299.Seq 


460 


10470 


80.A9.sp6:130216.Seq 


461 


22195 


80.B9.sp6:130228.Seq 


462 




80.C9.sp6:130240.Seq 


463 


4261 


80.D9.sp6:130252.Seq 


464 


84182 


80.E9.sp6:130264.Seq 


465 


40392 


80.H9.sp6:130300.Seq 


466 


16731 


80.C10.sp6:130241.Seq 


467 




80.D10.sp6:130253.Seq 


468 


13532 


80.E10.sp6:130265.Seq 


469 


8 


80.H1 0.sp6:l 30301 .Seq 


470 


36313 


80.All.sp6:130218.Seq 



151 



PCT/US98/27610 
Clone Name 

M00001368D:E03 

M00001370A:C09 

M00001371C:E09 

M00001376B:G06 

M00001378B:B02 

M00001379A:A05 

M00001380D:B09 

M00001381D:E06 

M00001382C:A02 

M00001383A:C03 

M00001384B:A11 

M00001386C:B12 

M00001387A:C05 

M00001387B:G03 

M00001388D:G05 

M00001389A:C08 

M00001394A:F01 

M00001396A:C03 

M00001400B:H06 

M00001402A:E08 

M00001407B:D11 

M00001409C:D12 

M00001410A:D07 

M00001412B:B10 

M00001414A:B01 

M00001414C:A07 

M00001415A:H06 

M00001416B:H11 

M00001417A:E02 

M00001418B:F03 

M00001421C:F01 

M00001423B:E07 

M00001424B:G09 

M00001425B:H08 

M00001426B:D12 

M00001426D:C08 

M00001428A:H10 

M00001429D:D07 

M00001442C:D07 

M00001443B:F01 

M00001445A:F05 

M00001448D:C09 

M00001448D:H01 



WO 99/33982 PCT7US98/2761 0 



SEQIDNO: Cluster ID 


Sequence Name 


Clone Name 


471 


5857 


80.Bll.sp6:130230.Seq 


M00001449A:A12 


472 


41633 


80.Cll.sp6:130242.Seq 


M00001449A:B12 


473 


36535 


80.Dll.sp6:130254.Seq 


M00001449A:G10 


474 


86110 


80.Ell.sp6:130266.Seq 


M00001449C:D06 


475 


32663 


8O.Fll.sp6:130278.Seq 


M00001450A:A11 


476 


27250 


80.Gll.sp6:130290.Seq 


M00001450A:D08 


477 


16970 


80.Hll.sp6:130302.Seq 


M00001452C:B06 


478 


16130 


80.A12.sp6:130219.Seq 


M00001453A:E11 


479 


16653 


80.B12.sp6:130231.Seq 


M00001453C:F06 


480 


7005 


80.C12.sp6:130243.Seq 


M00001454B:C12 


481 


13072 


80.F12.sp6:130279.Seq 


M00001455B:E12 


482 


9283 


80.G12.sp6:130291.Seq 


M00001455D:F09 


483 


23255 


100.Cl.sp6:131446.Seq 


M00001343D:H07 


484 


13576 


100.El.sp6:131470.Seq 


M00001347A:B10 


485 


7187 


100.C2.sp6:131447.Seq 


M00001350A:H01 


486 


14391 


100.E3.sp6:131472.Seq 


M00001355B:G10 


487 


945 


100.E4.sp6:131473.Seq 


M00001362C:H11 


488 


7172 


100.A5.sp6:131426.Seq 


M00001371C:E09 


489 


39648 


100.A6.sp6:131427.Seq 


M00001383A:C03 


490 


84182 


100.G9.sp6:131502.Seq 


M00001428A:H10 


491 


8 


100.Bll.sp6:131444.Seq 


M00001448D:C09 


492 


36535 


100.Dll.sp6:131468.Seq 


M00001449A.-G10 


493 


82498 


100.Fll.sp6:131492.Seq 


M00001450A:B12 


494 


16970 


100.C12.sp6:131457.Seq 


M00001452C:B06 


495 


16130 


100.D12.sp6:131469.Seq 


M00001453A:E11 


496 


7005 


121.Dl.sp6:131917.Seq 


M00001454B:C12 


497 




121.G6.sp6:131958.Seq 


M00001506D:A09 


498 


18957 


121.F7.sp6:131947.Seq 


M00001528A:F09 


499 


40044 


122.El.sp6:132121.Seq 


M00001621C:C08 


500 


5214 


122.C2.sp6:132098.Seq 


M00001630B:H09 


501 


6660 


122.B5.sp6:132089.Seq 


M00001679A:A06 


502 


13183 


123.D5.sp6:132305.Seq 


M00004114C:F11 


503 


6455 


123.E7.sp6:132319.Seq 


M00004157C:A09 


504 


5319 


123.F7.sp6:132331.Seq 


M00004169C:C12 


505 


11443 


123.A8.sp6:132272.Seq 


M00004185C:C03 


506 




123.C8.sp6:132296.Seq 


M00004191D:B11 


507 


8210 


123.E8.sp6:132320.Seq 


M00004197D:H01 


508 


9457 


123.Dll.sp6:132311.Seq 


M00004307C:A06 


509 


6420 


172.El.sp6:133925.Seq 


M00001345A:E01 


510 


16245 


172.D2.sp6:133914.Seq 


M00001352A.-E02 


511 


8078 


172.C3.sp6:133903.Seq 


M00001353A:G12 


512 


14929 


172.D3.sp6:133915.Seq 


M00001353D:D10 


513 


14391 


172.H3.sp6:133963.Seq 


M00001355B:G10 
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SEQ ID NO: Cluster ID 


Sequence Name 


514 


6583 


172.B8.sp6:133896.Seq 


515 


4009 


172.D8.sp6:133920.Seq 


516 




172.B9.sp6:133897.Seq 


517 




176.A3.sp6:134514.Seq 


518 


19267 


176.G3.sp6:134586.Seq 


519 


78091 


176.G5.sp6:134588.Seq 


520 


17055 


176.D6.sp6:134553.Seq 


521 


6539 


176.D9.sp6:134556.Seq 


522 




177.H4.sp6:134791.Seq 


523 


5257 


177.F5.sp6:134768.Seq 


524 


11494 


177.E6.sp6:134757.Seq 


525 




177.G7.sp6:134782.Seq 


526 


11451 


177.D8.sp6:134747.Seq 


527 


9283 


173.D2.SP6:134106.Seq 


528 


16283 


173.F3.SP6:134131.Seq 


529 


10539 


173.B5.SP6:134085.Seq 


530 


6420 


173.F5.SP6:134133.Seq 


531 


3956 


173.H5.SP6:134157.Seq 


532 




173.G7.SP6:134147.Seq 


533 


1577 


173.C9.SP6:134101.Seq 


534 


9635 


173.D9.SP6:134113.Seq 


535 


5192 


173.E9.SP6:134125.Seq 


536 


6539 


173.A12.SP6:134080.Seq 


537 


945 


180.C2.sp6:135940.Seq 


538 


7005 


180.H5.sp6:136003.Seq 


539 


39304 


180.G9.sp6:135995.Seq 


540 


27250 


180.B10.sp6:135936.Seq 


541 


35555 


184.A5.sp6:135530.Seq 


542 


19255 


184.B10.sp6:135547.Seq 


543 


6268 


184.C12.sp6:135561.Seq 


544 


3277 


217.El.sp6:139406.Seq 


545 


39171 


217.A12.sp6:139369.Seq 


546 


11460 


219.F2.sp6:139035.Seq 


547 


10539 


219.F6.sp6:139039.Seq 


548 


11476 


219.H8.sp6:139065.Seq 


549 


4016 


79.Al.sp6:130016.Seq 


550 


7674 


79.Cl.sp6:130040.Seq 


551 


3681 


79.El.sp6:130064.Seq 


552 


39304 


79.Fl.sp6:130076.Seq 


553 


82498 


79.Gl.sp6:130088.Seq 


554 


84328 


79.A2.sp6:130017.Seq 


555 


86859 


79.B2.sp6:130029.Seq 


556 


1120 


79.C2.sp6:130041.Seq 



153 



PCT7US98/27610 
Clone Name 

M00001394A:F01 

M00001396A:C03 

M00001400B:H06 

M00001632D:H07 

M00001645A:C12 

M00001679C:F01 

M00001682C:B12 

M00003844C:B11 

M00004121B:G01 

M00004146C:C11 

M00004172C:D08 

M00004205D:F06 

M00004214C:H05 

M00001455D:F09 

M00001467A:D08 

M00001499B:A11 

M00001504D:G06 

M00001512D:G09 

M00001544A:E06 

M00001556A:F11 

M00001557A:F01 

M00001557B:H10 

M00001579D:C03 

M00001362C:H11 

M00001410A:D07 

M00001450A:A02 

M00001450A:D08 

M00001528A:C04 

M00001545A:C03 

M00001551A:B10 

M00001624A:B06 

M00001644C-.B07 

M00001676B:F05 

M00001680D:F08 

M00003747D:C05 

M00001395A:C03 

M00001416A:H01 

M00001449A:D12 

M00001450A:A02 

M00001450A:B12 

M00001452A:B04 

M00001452A:B12 

M00001452A:D08 



WO 99/33982 



SEQ ID NO: Cluster ED 


Sequence Name 


557 


85064 


79.D2.sp6:130053.Seq 


558 


83103 


79.G2.sp6:130089.Seq 


559 


10145 


79.F3.sp6:130078.Seq 


560 


16283 


79.H3.sp6:130102.Seq 


561 


4568 


79.D4.sp6:130055.Seq 


562 


4313 


79.F4.sp6:130079.Seq 


563 


2428 


79.A5.sp6:130020.Seq 


564 


39423 


79.C5.sp6:130044.Seq 


565 


39174 


79.E5.sp6:130068.Seq 


566 


22113 


79.F5.sp6:130080.Seq 


567 


19829 


79.H5.sp6:130104.Seq 


568 


13864 


79.B6.sp6:130033.Seq 


569 


1058 


79.F6.sp6:130081.Seq 


570 


4015 


79.G6.sp6:130093.Seq 


571 


39180 


79.A7.sp6:130022.Seq 


572 


307 


79.C7.sp6:130046.Seq 


573 


39458 


79.D7.sp6:130058.Seq 


574 


39490 


79.G7.sp6:130094.Seq 


575 


39486 


79.B8.sp6:130035.Seq 


576 


39380 


79.E8.sp6:130071.Seq 


577 


1399 


79.G8.sp6:130095.Seq 


578 


39391 


79.A9.sp6:130024.Seq 


579 


6268 


79.G9.sp6:130096.Seq 


580 




377.F4.sp6:141957.Seq 


581 


2448 


89.Al.sp6:130667.Seq 


582 


1531 


89.Cl.sp6:130691.Seq 


583 


19 


89.Dl.sp6:130703.Seq 


584 


38759 


89.Fl.sp6:130727.Seq 


585 


39508 


89.Gl.sp6:130739.Seq 


586 


16283 


89.Hl.sp6:130751.Seq 


587 


39442 


89.A2.sp6:130668.Seq 


588 


7589 


89.B2.sp6:130680.Seq 


589 




89.C2.sp6:130692.Seq 


590 


12081 


89.D2.sp6:130704.Seq 


591 


19105 


89.E2.sp6:130716.Seq 


592 


1037 


89.F2.sp6:130728.Seq 


593 


39425 


89.G2.sp6:130740.Seq 


594 


39478 


89.H2.sp6:130752.Seq 


595 




89.B3.sp6:130681.Seq 


596 




89.C3.sp6:130693.Seq 


597 


18699 


89.D3.sp6:130705.Seq 


598 


7206 


89.E3.sp6:130717.Seq 


599 


2623 


89.F3.sp6:130729.Seq 



154 



PO7US98/27610 
Clone Name 
M00001452A:F05 
M00001454A:A09 
M00001465A:B11 
M00001467A:D08 
MOOOO1513A:B06 
M00001517A:B07 
M00001533A:C11 
M00001535A:F10 
M00001541A:H03 
M00001542A:A09 
M00001544A:G02 
M00001545A:D08 
M00001548A:H09 
M00001549A:B02 
MOO001551A:FO5 
M00001552A:B12 
M00001552A:D11 
M00001557A:F03 
M00001561A:C05 
M00001587A:B11 
M00001604A:B10 
M00001604A:F05 
M00001551A:B10 
M00004692A:H08 
M00001460A:F06 
M00001461A:D06 
M00001463C:B11 
M00001467A:B07 
M00001467A:D04 
M00001467A:D08 
M00001467A:E10 
M00001468A:F05 
M00001469A:A01 
M00001469A:C10 
M00001469A:H12 
M00001470A:B10 
M00001470A:C04 
M00001471A:B01 
M00001487B:H06 
M00001488B:F12 
M00001490B:C04 
M00001494D:F06 
M00001497A.G02 



WO 99/33982 



SEQ ID NO: Cluster ID 


Sequence Name 


600 


10539 


89.G3.sp6:130741.Seq 


601 


5336 


89.H3.sp6:130753.Seq 


602 


2623 


89.A4.sp6:130670.Seq 


603 


9443 


89.B4.sp6:130682.Seq 


604 


9685 


89.C4.sp6:130694.Seq 


605 




89.D4.sp6:130706.Seq 


606 


10185 


89.E4.sp6:130718.Seq 


607 


6974 


89.F4.sp6:130730.Seq 


608 


6420 


89.G4.sp6:130742.Seq 


609 




89.H4.sp6:130754.Seq 


610 




89.A5.sp6:130671.Seq 


611 


39168 


89.B5.sp6:130683.Seq 


612 


39412 


89.C5.sp6:130695.Seq 


613 


39186 


89.D5.sp6:130707.Seq 


614 


3956 


89.E5.sp6:130719.Seq 


615 




89.F5.sp6: 13073 l.Seq 


616 


14364 


89.G5.sp6:130743.Seq 


617 


40044 


89.H5.sp6:130755.Seq 


618 


8952 


89.A6.sp6:130672.Seq 


619 


35555 


89.B6.sp6.130684.Seq 


620 


18957 


89.C6.sp6:130696.Seq 


621 


8358 


89.D6.sp6:130708.Seq 


622 


38085 


89.E6.sp6:130720.Seq 


623 




89.F6.sp6:130732.Seq 


624 


3990 


89.G6.sp6:130744.Seq 


625 


16921 


89.H6.sp6:130756.Seq 


626 


5321 


89.B7.sp6:130685.Seq 


627 


4119 


89.C7.sp6:130697.Seq 


628 


20212 


89.E7.sp6:130721.Seq 


629 


2696 


89.F7.sp6:130733.Seq 


630 


39392 


89.G7.sp6:130745.Seq 


631 


39420 


89.H7.sp6:130757.Seq 


632 


3389 


89.A8.sp6:130674.Seq 


633 


8286 


89.B8.sp6:130686.Seq 


634 


3765 


89.C8.sp6:130698.Seq 


635 


39453 


89.E8.sp6:130722.Seq 


636 




89.F8.sp6:130734.Seq 


637 




89.H8.sp6:130758.Seq 


638 


6974 


89A9.sp6:130675.Seq 


639 




89.B9.sp6:130687.Seq 


640 


19255 


89.C9.sp6:130699.Seq 


641 


1267 


89.D9.sp6: 13071 l.Seq 


642 


5892 


89.E9.sp6:130723.Seq 



155 



PCT/US98/27610 
Clone Name 
M00001499B:A11 
M00001500A-.C05 
M00001500A:E11 
M00001500C.E04 
M00001501D:C02 
MOO0015O4A:E01 
M00001504C:A07 
M00001504C:H06 
M00001504D:G06 
M00001505C:C05 
M00001506D:A09 
M00001507A:H05 
M00001511A:H06 
M00001512A:A09 
M00001512D:G09 
M00001513B:G03 
M00001513C:E08 
M00001514C:D11 
M00001518C:B11 
M00001528A:C04 
M00001528A:F09 
M00001528B:H04 
MOO001531A:D01 
M00001531A:H11 
M00001532B:A06 
M00001534A:C04 
M00001534A:F09 
M00001534C:A01 
M00001535A:C06 
M00001536A:B07 
M00001536A:C08 
M00001537A:F12 
M00001537B:G07 
M00001540A:D06 
M00001541A:D02 
M00001542A:E06 
M00001542B:B01 
M00001544A:E06 
M00001544B:B07 
M00001545A:B02 
M00001545A:C03 
M00001546A:G11 
M00001548A:E10 



WO 99/33982 
SEQIDNO: Cluster ID Sequence Name 



643 


4193 


89.G9.sp6:130747.Seq 


644 


16347 


89.H9.sp6:130759.Seq 


645 


7239 


89.A10.sp6:130676.Seq 


646 


5175 


89.B10.sp6:130688.Seq 


647 


22390 


89.C10.sp6:130700.Seq 


648 


3266 


89.D10.sp6:130712.Seq 


649 


5708 


89.E10.sp6:130724.Seq 


650 




89.F10.sp6:130736.Seq 


651 


8298 


89.G10.sp6:130748.Seq 


652 


4573 


89.H10.sp6:130760.Seq 


653 


22814 


89.All.sp6:130677.Seq 


654 


39539 


89.Bll.sp6:130689.Seq 


655 


39195 


89.Cll.sp6:130701.Seq 


656 


4561 


89.Dll.sp6:130713.Seq 


657 


9244 


89.Ell.sp6:130725.Seq 


658 


1577 


89.Fll.sp6:130737.Seq 


659 


4386 


89.Hll.sp6: 130761. Seq 


660 


11294 


89.A12.sp6:130678.Seq 


661 


5192 


89.D12.sp6:130714.Seq 


662 


8761 


89.E12.sp6:130726.Seq 


663 




89.F12.sp6:130738.Seq 


664 


7514 


89.G12.sp6:130750.Seq 


665 




89.H12.sp6:130762.Seq 


666 


6558 


90.Al.sp6:130859.Seq 


667 


102 


90.Bl.sp6: 130871. Seq 


668 




90.Dl.sp6:130895.Seq 


669 


5749 


90.El.sp6:130907.Seq 


670 


6539 


90.Gl.sp6:130931.Seq 


671 


6293 


90.A2.sp6:130860.Seq 


672 




90.C2.sp6:130884.Seq 


673 


260 


90.D2.sp6:130896.Seq 


674 


4837 


90.E2.sp6:130908.Seq 


675 


10470 


90.F2.sp6:130920.Seq 


676 


16999 


90.G2.sp6:130932.Seq 


677 


22794 


90.H2.sp6:130944.Seq 


678 


11465 


90. A3 .sp6: 130861. Seq 


679 


7802 


90.B3.sp6: 130873. Seq 


680 


22155 


90.C3.sp6:130885.Seq 


681 




90.D3.sp6:130897.Seq 


682 


13157 


90.E3.sp6:130909.Seq 


683 


17004 


90.F3.sp6:130921.Seq 


684 


40314 


90.G3.sp6:130933.Seq 


685 


40044 


90.H3.sp6:130945.Seq 



PCT/US98/27610 
Clone Name 

M00001549B:F06 

M00001549C:E06 

MO00O1550A:A03 

M00001550A:G01 

M00001551A:G06 

MO00O1551C:G09 

M00001552B:D04 

M00001552D:A01 

MO0OO1553A:HO6 

M00001553B:F12 

M00001553D.D10 

M00001555A:B02 

M00001555A:C01 

M00001555D:G10 

M00001556A:C09 

M00001556A:F11 

M00001556B:C08 

M00001556B:G02 

M00001557B:H10 

M00001557D:D09 

M00O01558A:H05 

M00001558B:H11 

M00001559B:F01 

M00001560D:F10 

M00001563B:F06 

M00001566B-.D11 

M00001571C:H06 

M00001579D:C03 

M0O001583D:AtO 

M00001590B:F03 

M00001594B:H04 

M00001597C:H02 

M00001597D:C05 

M00001598A:G03 

M00001601A:D08 

M00001607A:E11 

M00001608A:B03 

M00001608B:E03 

M00001608D:All 

M00001614C:F10 

M00001617C:E02 

M00001619C:F12 

M00001621C:C08 



WO 99/33982 



SEQIDNO: Cluster ID 


Sequence Name 


686 


13913 


90.A4.sp6:130862.Seq 


687 


3277 


90.B4.sp6:130874.Seq 


688 


4309 


90.C4.sp6:130886.Seq 


689 


5214 


90.D4.sp6:130898.Seq 


690 




90.E4.sp6: 13091 0.Seq 


691 


39171 


90.F4.sp6:130922.Seq 


692 


19267 


90.G4.sp6:130934.Seq 


693 


4665 


90.H4.sp6:130946.Seq 


694 




90.A5.sp6:130863.Seq 


695 


23201 


90.B5.sp6:130875.Seq 


696 


76760 


90.C5.sp6:130887.Seq 


697 


23218 


90.D5.sp6:130899.Seq 


698 


35702 


90.E5.sp6:130911.Seq 


699 


6468 


90.F5.sp6:l 30923. Seq 


700 


14367 


90.G5.sp6:130935.Seq 


701 


7015 


90.H5.sp6:130947.Seq 


702 


8773 


90.A6.sp6:130864.Seq 


703 


11460 


90.B6.sp6:130876.Seq 


704 


7570 


90.D6.sp6:130900.Seq 


705 


4416 


90.E6.sp6:130912.Seq 


706 


6660 


90.F6.sp6:130924.Seq 


707 




90.H6.sp6:130948.Seq 


708 


26875 


90.A7.sp6:130865.Seq 


709 


6298 


90.B7.sp6:130877.Seq 


710 


78091 


90.C7.sp6:130889.Seq 


711 


10751 


90.D7.sp6: 130901. Seq 


712 


10539 


90.F7.sp6:130925.Seq 


713 


17055 


90.G7.sp6:130937.Seq 


714 


5382 


90.A8.sp6:130866.Seq 


715 


4393 


90.B8.sp6:130878.Seq 


716 


67252 


90.C8.sp6:130890.Seq 


717 


40108 


90.D8.sp6:130902.Seq 


718 


11476 


90.E8.sp6:130914.Seq 


719 




90.F8.sp6:130926.Seq 


720 


697 


90.G8.sp6:130938.Seq 


721 




90.H8.sp6:130950.Seq 


722 


17076 


90.A9.sp6:130867.Seq 


723 


3108 


90.B9.sp6:130879.Seq 


724 


67907 


90.C9.sp6: 130891. Seq 


725 




90.D9.sp6:130903.Seq 


726 


11350 


90.F9.sp6:130927.Seq 


727 


7899 


90.H9.sp6: 130951. Seq 


728 


7798 


90.A10.sp6:130868.Seq 



157 



PCT/US98/27610 
Clone Name 

M00001623D:F10 

M00001624A:B06 

M00001624C:F01 

M00001630B:H09 

M00001632D.H07 

M00001644C:B07 

M00001645A:C12 

M00001648C:A01 

M00001651A:H01 

M00001657D:C03 

M00001657D:F08 

M00001662C:A09 

M00001663A:E04 

M00001669B:F02 

M00001670C:H02 

M00001673C:H02 

M00001675A:C09 

M00001676B:F05 

M00001677D:A07 

M00001678D:F12 

M00001679A:A06 

M00001679A:F06 

M00001679A:F10 

M00001679B:F01 

M00001679C:F01 

M00001679D:D03 

M00001680D:F08 

M00001682C:B12 

M00001688C:F09 

M00001693C:G01 

M00001716D:H05 

M00003741D:C09 

M00003747D:C05 

M00003754C:E09 

M00003759B:B09 

M00003761D:A09 

M00003762C:B08 

M00003763A:F06 

M00003774C:A03 

M00003784D:D12 

M00003826B:A06 

M00003837D:A01 

M00003839A:D08 



WO 99/33982 
SEQ ID NO: Cluster ID Sequence Name 



729 


6539 


90.B10.sp6:130880.Seq 


730 


6874 


90.C10.sp6:130892.Seq 


731 




9O.D10.sp6:13O904.Seq 


732 


13595 


90.E10.sp6:130916.Seq 


733 


5619 


90.F10.sp6:130928.Seq 


734 


10515 


90.G10.sp6:130940.Seq 


735 


4622 


90.H10.sp6:130952.Seq 


736 


3389 


90.All.sp6:130869.Seq 


737 


4718 


90.Bll.sp6:130881.Seq 


738 




90.Cll.sp6:130893.Seq 


739 


12977 


90.Fll.sp6:130929.Seq 


740 


8479 


90.Gll.sp6:130941.Seq 


741 




90.Hll.sp6:130953.Seq 


742 


7798 


90.A12.sp6:130870.Seq 


743 


5345 


90.B12.sp6:130882.Seq 


744 


31587 


90.C12.sp6:130894.Seq 


745 


14507 


90.D12.sp6:130906.Seq 


746 


13576 


90.F12.sp6:130930.Seq 


747 




90.G12.sp6.130942.Seq 


748 


9285 


90.H12.sp6:130954.Seq 


749 


39809 


99.Al.sp6:131230.Seq 


750 


16317 


99.Bl.sp6:131242.Seq 


751 


8672 


99.Cl.sp6:131254.Seq 


752 


12532 


99.Dl.sp6:131266.Seq 


753 


3900 


99.El.sp6:131278.Seq 


754 


23255 


99.Fl.sp6:131290.Seq 


755 


24488 


99.C2.sp6:131255.Seq 


756 


40122 


99.D2.sp6:131267.Seq 


757 


23210 


99.E2.sp6:131279.Seq 


758 


23358 


99.F2.sp6: 13 1291. Seq 


759 


3430 


99.A3.sp6:131232.Seq 


760 


2433 


99.B3.sp6:131244.Seq 


761 


9105 


99.C3.sp6:131256.Seq 


762 


6124 


99.D3.sp6:131268.Seq 


763 


40073 


99.E3.sp6:131280.Seq 


764 


37285 


99.H3.sp6:131316.Seq 


765 


17036 


99.A4.sp6:131233.Seq 


766 


3706 


99.C4.sp6:131257.Seq 


767 




99.D4.sp6:131269.Seq 


768 


15069 


99.F4.sp6:131293.Seq 


769 


9285 


99.H4.sp6:131317.Seq 


770 


6880 


99.A5.sp6:131234.Seq 


771 


5325 


99.C5.sp6:131258.Seq 



158 



PCT/US98/27610 
Clone Name 

M00003844C:B11 

M00003846B:D06 

M00003851B:D08 

M00003851B:D10 

M00003853A:D04 

M00003853A:F12 

M00003856B:C02 

M00003857A:G10 

M00003857A:H03 

M00003867A:D10 

M00003875B:F04 

M00003875C:G07 

M00003875D:D11 

M00003876D:E12 

M00003879B:C11 

M00003879B:D10 

M00003879D:A02 

M00003885C:A02 

M00003891C.H09 

M00003906C:E10 

M00003907D:A09 

M00003907D:H04 

M00003909D:C03 

M00003912B:D01 

M00003914C:F05 

M00003922A:E06 

M00003968B:F06 

M00003970C:B09 

M00003974D:E07 

M00003974D:H02 

M000O3981A:E10 

M00003982C:C02 

M00003983A:A05 

M00004028D:A06 

M00004028D:C05 

M00004035C:A07 

M00004035D:B06 

M00004068B:A01 

M000O4072A:CO3 

M00004081C:D10 

M00004086D:G06 

M00004087D:A01 

M00004093D:B12 



WO 99/33982 PCT/US98/27610 



SEQ ID NO: Cluster ID 


Sequence Name 


Clone Name 


772 


7221 


99.D5.sp6:131270.Seq 


M00004105C:A04 


773 


4937 


99.E5.sp6:131282.Seq 


M00004108A:E06 


774 


6874 


99.F5.sp6:131294.Seq 


M00004111D:A08 


775 


13183 


99.G5.sp6:131306.Seq 


M00004114C:F11 


776 




99.H5.sp6:131318.Seq 


M00004121B:G01 


111 


13272 


99.A6.sp6:131235.Seq 


M00004138B:H02 


778 


5257 


99.B6.sp6:131247.Seq 


M00004146C:C11 


779 


6455 


99.D6.sp6: 131 271. Seq 


M00004157C:A09 


780 


5319 


99.E6.sp6:131283.Seq 


M00004169C:C12 


781 


4908 


99.F6.sp6:131295.Seq 


M00004171D:B03 


782 


11494 


99.G6.sp6:131307.Seq 


M00004172C:D08 


783 


11443 


99.A7.sp6:131236.Seq 


M00004185C:C03 


784 




99.B7.sp6:131248.Seq 


M00004191D:B11 


785 


8210 


99.C7.sp6:131260.Seq 


M00004197D:H01 


786 


14311 


99.D7.sp6:131272.Seq 


M00004203B:C12 


787 




99.E7.sp6:131284.Seq 


M00004205D:F06 


788 


12971 


99.B8.sp6:131249.Seq 


M00004223D:E04 


789 


6455 


99.C8.sp6: 13 1261. Seq 


M00004229B:F08 


790 


7212 


99.D8.sp6:131273.Seq 


M00004230B:C07 


791 


4905 


99.H8.sp6: 13 1321. Seq 


M00004269D:D06 


792 


16914 


99.A9.sp6:131238.Seq 


M00004275C:C11 


793 


16921 


99.D9.sp6:131274.Seq 


M00004295D:F12 


794 


13046 


99.E9.s P 6:131286.Seq 


M00004296C:H07 


795 


9457 


99.F9.sp6:131298.Seq 


M00004307C:A06 


796 


26295 


99.G9.sp6:131310.Seq 


M00004312A:G03 


797 


21847 


99.H9.sp6:131322.Seq 


M00004318C:D10 


798 




99.H10.sp6: 13 1323. Seq 


M00004505D:F08 


799 




99.Bll.sp6: 13 1252.Seq 


M00004692A:H08 


800 




99.Dll.sp6:131276.Seq 


M00005180C:G03 


801 


39304 


RTA000001 1 8AJ.2 1 . 1 .Seq_THC 1 5 1 859 




802 


2428 


RTAOOOOO 123 A.1.2 1 . 1 .Seq_THC205063 




803 


1058 


RTA00000126A.e.20.3.Seq_THC217534 




804 


5097 


RTAOOOOO 134A.k. 1 . 1 .Seq_THC2 1 5869 




805 


20212 


RTAOOOOO 134A.1.22. 1 .Seq_THC 1 28232 




806 


23255 


RTAOOOOO 1 77AF.e. 1 4.3 .Seq_THC228776 




807 


2790 


RTAOOOOO 1 77AF.e.2. 1 .Seq_THC22946 1 




808 


6420 


RTAOOOOO 1 77 AF.f. 1 0.3 .Seq_THC226443 




809 


4059 


RTAOOOOO 1 77AF.n. 1 8.3.Seq_THC 1 2305 1 




810 




RTAOOOOO 1 79AF.j .13.1 .Seq_THC 1 05720 




811 


9952 


RTAOOOOO 1 80AF.C.20. 1 .Seq_THC 1 62284 




812 


13238 


RTAOOOOO 1 8 1 AF.m.4. 1 .Seq_THCl 4069 1 




813 


9685 


RTAOOOOO 1 83AF.C. 11.1 .Seq_THC 1 09544 




814 




RTA00000183AF.c.24.1.Seq_THC125912 
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815 


6420 


RTA00000 1 83 AF.d. 1 1 . 1 .Seq_THC226443 


816 


6974 


RTA000001 83AF.d.9.1 .Seq_THC223 129 


817 


40044 


RTA000001 83AF.g.22.1 .Seq_THC232899 


818 




RTA000001 83 AF.g.9. 1 .Seq_THC 1 98280 


819 


5892 


RTA000001 84AF.d. 11.1 .Seq_THC 161 896 


820 


40044 


RTA000001 86AF.d. 1 . 1 .Seq_THC232899 


821 




RTA000001 86AF.h. 14. 1 .Seq_THC 1 12525 


822 


19267 


RTA00000 1 86AF.1. 12.1 .Seq_THC 1 78 1 83 


823 


8773 


RTA00000 1 87AF.f.24. 1 .Seq_THC220002 


824 


7570 


RTA000001 87AF.g.24.1 .SeqJTHC 1 68636 


825 


11476 


RTA00000 1 87 AF.p. 1 9. 1 .SeqJTHC 1 08482 


826 




RTA000001 88 AF.d. 1 1 .1 .Seq_THC2 12094 


827 


17076 


RTA000001 88 AF.d.2 1 . 1 .Seq_THC208760 


828 


697 


RTA000001 88AF.d.6. 1 .Seq_THCl 78884 


829 


67907 


RTA000001 88AF.g. 11.1 .SeqJTHC 123222 


830 


5619 


RTA00000188AF.I.9.1.Seq_THC167845 


831 


4718 


RTAOOOOO 1 89AF.g.5. 1 .SeqJTHC 1 96 1 02 


832 


39809 


RTAOOOOO 1 90AF.e.3 . 1 .SeqJTHC 1 502 1 7 


833 


23255 


RTA000001 90AFJ.4. 1 .Seq_THC228776 


834 


40122 


RTAOOOOO 1 90AF.n.23. 1 .SeqJTHC 1 09227 


835 


23210 


RTA00000190AF.o.20.1.Seq_THC207240 


836 


23358 


RTAOOOOO 1 90AF.O.2 1 . 1 .Seq_THC207240 


837 


5693 


RTAOOOOO 190 AF.p. 1 7.2.Seq_THC 1 733 1 8 


838 


2433 


RTAOOOOO 1 9 1 AF.a. 1 5.2.Seq_THC79498 


839 


5257 


RTAOOOOO 1 92AF.0. 1 .Seq_THC2 1 3 833 


840 


16392 


RTAOOOOO 1 92AF.1. 1 . 1 .Seq_THC20207 1 


841 




RTAOOOOO 1 93 AF.c.2 1 . 1 .Seq_THC222602 


842 


26295 


RTAOOOOO 1 93 AF.i.24.2.Seq_THC 1 97345 


843 




RTA000001 93 AF.m.5. 1 .SeqJTHC 173318 


844 




RTAOOOOO 1 93 AF.n. 1 5.1 .Seq_THC2 1 5687 
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1111111 
III!!!! 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


1 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


2 


! <NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


3 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


4 


<NONE> 


<NONE> 


<NONE> 


BAR3_CHITE 


BALBIANI RING 
PROTEIN 3 
PRECURSOR>PIR2:S 
08167 Balbiani ring 3 
protein - midge 
(Chironomus 
tentans)>GP:CTBR3_l 
L^tenians Daioiani ring 
3 (BR3) gene 


1 


5 


<NONE> 


<NONE> 


<NONE> 


CYAA PODA 
N 


ADENYLATE 
CYCLASE (EC 
4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE)>PIR2:JC4 
747 adenylate cyclase 
(EC 4.6.1.1)- 
Podospora 

anserina>GP:PANADC 
Y_l Podospora 
anserina adenyl cyclase 
gene, exons 1-4 


1 


6 


<NONE> 


<NONE> 


<NONE> 


VP03 HSVSA 


PROBABLE 
MEMBRANE 
ANTIGEN 3 
(TEGUMENT 
PROTEIN)>PIR2:C36 
806 hypothetical 
protein ORF3 - 
saimiriine herpesvirus 1 
(strain 

11)>GP:HSGEND_3 
Herpesvirus saimiri 
complete genome 
DNA; ORF 03; 
similarity to ORF 75 
and EBV BNRF1 


0.97 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


7 


<NONE> 


<NONE> 


<NONE> 


ATFCA2J8 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 2; 
Hydroxyproline-rich 
glycoprotein homolog; 
Similarity to 
hydroxyproline-rich 
glycoprotein precursor- 
common tobacco 


0.93 


8 


<NONE> 


<NONE> 


<NONE> 


DHAL ASPN 
G 


ALDEHYDE 
DEHYDROGENASE 
(EC 1.2.1.3) 
(ALDDH)>GP:ASNA 
LDAA_1 Aspergillus 
niger aldehyde 
dehydrogenase (aldA) 
gene, complete cds 


0.9 


9 


<NONE> 


<NONE> 


<NONE> 


NCU50264J 


Neurospora crassa two- 
component histidine 
kinase (nik-1) gene, 5* 
region and partial cds 


0.86 


10 


<NONE> 


<NONE> 


<NONE> 


NEUG BOVI 
N 


NEUROGRANIN 
(P17)(B-50 
IMMUNOREACTIVE 
C-KINASE 

SUBSTRATE) j 
(BICKS) 

(FRAGMENT)>PIR2: 
A3 9034 neurogranin - 
bovine (fragment) 


0.82 


11 


<NONE> 


<NONE> 


<NONE> 


HUMBYSTIN 
_1 


Homo sapiens bystin 
mRNA, complete cds 


0.81 


12 


<NONE> 


<NONE> 


<NONE> 


BTBMP1_1 


Bos taurus BMP1 gene, 
partial sequence; Bone 
morphogenetic protein 
1 


0.69 


13 


<NONE> 


<NONE> 


<NONE> 


TCCYSPROT 
_1 


T;congolense mRNA 
for (prepro) cysteine 
proteinase 


0.56 


14 


<NONE> 


<NONE> 


<NONE> 


P60_LISIV 


PROTEIN P60 
PRECURSOR 
(INVASION- 
ASSOCIATED 
PROTEIN)>GP:LISIA 
PRELBJ Listeria 


0.15 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












ivanovii extracellular 
protein homologue 
(iap) gene, complete 
cds 




15 


<NONE> 


<NONE> 


<NONE> 


HEX_ADE31 


HEXON PROTEIN 
(LATE PROTEIN 2) 
(FRAGMENT)>PIR2: 
S3 72 17 hexon protein - 
human adenovirus 3 1 
(fragment)>GP:HSAT3 
1H_1 H;sapiens 
adenovirus type 31 
hexon gene; Hexon 
protein; Internal 
fragment containing 
hypervariable regions 


0.15 


16 


<NONE> 


<NONE> 


<NONE> 


HSU77493J 


Human Notch2 mRNA, 
partial cds; 
Transmembrane 


0.13 


17 


<NONE> 


<NONE> 


<NONE> 


CYB_PARTE 


CYTOCHROME B 
(EC 

1.10.2.2)>PIR2:S07743 
cytochrome b - 
Paramecium tetraurelia 
mitochondrion 
(SGC6)>GP:MIPAGE 
N_ 19 Paramecium 
aurelia mitochondrial 
complete genome ; 
Apocytochrome b (AA 
1-391) 


0.078 


18 


<NONE> 


<NONE> 


<NONE> 


HUMERB27 
1 


Human c-erbB-2 gene, 
exon 7; C-erb-2 protein 


0.054 


19 


<NONE> 


<NONE> 


<NONE> 


DMTRXIII_2 


D;melanogaster DNA 
for trxl and trxll genes; 
Trithorax protein trxl; 
Trithorax; 

putative>GP:DMTTHO 
RAXJ2 

D;melanogaster DNA 
for (putative) trithorax 
protein; Predicted 
trithorax protein 


0.047 



163 



WO 99/33982 



Table 2 



PCMJS98/27610 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


20 


<NONE> 


<NONE> 


<NONE> 


CELB0281_5 


Caenorhabditis elegans 
cosmid B0281; Similar 
to reverse transcriptases 


0.043 


21 


<NONE> 


<NONE> 


<NONE> 


MOTY VIBP 
A 


SODIUM-TYPE 
FLAGELLAR 
PROTEIN MOTY 
PRECURSOR>GP:VP 
U06949_4 Vibrio 
parahaemolyticus 
BB22 RNase T (mt) 
gene and flagellar 
motor component 
(motY) gene, complete 
cds 


0.041 


22 


<NONE> 


<NONE> 


<NONE> 


A56263 


beta-galactosidase (EC 
3.2.1.23) isozyme 12- 
Arthrobacter sp. (strain 
B7)>GP:ASU17417_1 
Arthrobacter sp; beta- 
galactosidase gene, 
complete cds 


0.04 


23 


<NONE> 


<NONE> | 


<NONE> 


GSA_PSEAE 


GLUTAMATE-1- 
SEMIALDEHYDE 
2, 1 -AMINOMUTASE 
(EC 5.4.3.8) (GSA) 
(GLUTAMATE- 1 - 
SEMIALDEHYDE 
AMINOTRANSFERA 
SE) (GSA- 
AT)>PIR2:S57898 
glutamate 1- 
semialdehyde 2,1- 
aminomutase - 
Pseudomonas 
aeruginosa>GP:PAHE 
ML__1 P;aeruginosa 
hemL gene; Glutamate 
1-sem 


0.038 


24 * 


<NONE> 


<NONE> 


<NONE> 


S16323 


hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHBl_l 
A;thaliana homeobox 
gene Athb-l mRNA; 
Open reading frame 


0.035 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


25 


<NONE> 


<NONE> 


<NON£> 


IRS1 JIAT 


INSULIN RECEPTOR 
SUBSTRATE- 
1>PIR2:S 16948 
hypothetical protein 
IRS-1 - 

rat>GP:RNIRSHRMJ 
R;Norvegicus IRS-1 
mRNA for insulin- 
receptor; During insulin 
stimulation, undergoes 
tyrosine 

phosphorylation and 
binds 

phosphatidyl inositol 3- 
kinase 


0.027 


26 


<NONE> 


<NONE> 


<NONE> 


CEM02G9_2 


Caenorhabditis eiegans 
cosmid M02G9; 
M02G9;1; Similar to 
keratin like protein; 
cDNA EST 
yk308gl 1;5 comes 
from this gene; cDNA 
EST yk208ell;5 comes 
from this gene; cDNA 
EST yk208ell;3 comes 


0.0088 


27 


<NONE> 


<NONE> 


<NONE> 


S75490_3 


competence region: 
iga=IgA protease, 
cornA=transformation 
competence [Neisseria 
gonorrhoeae, MS11, 
Genomic, 3 genes, 
zooh nij 


0.0041 


28 


<NONE> 


<NONE> 


<NONE> 


EXTN TOBA 
C 


EXTENSIN 
PRECURSOR (CELL 
WALL 

HYDROXYPROLINE- 
RICH 

GLYCOPROTEIN)^! 
R2:S06733 
hydroxyproline-rich 
glycoprotein precursor 
- common 

tobacco>GP:NTEXTJ 
Tobacco HRGPnt3 
gene for extensin; 
Extensin (A A 1-620) 


0.0025 
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111111 

.'v.f 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


29 


<NONE> 


<NONE> 


<NONE> 


HPCEGS 1 


HeDatitis C virus 
complete genome 
sequence; Polyprotein 


0.0014 


30 


<NONE> 


<NONE> 


<NONE> 


HHVBC_4 


Human hepatitis virus 
(genotype C, HMA) 
preSl,preS2, S, C, X, 
antigens, core antigen, 
X protein and 
polymerase 


0.00093 


31 


<NONt> 


<NUNt> 


<NONE> 


HSLTGFBP4 
1 


Homo sapiens mRNA 
for latent transforming 
growth factor-beta 
binding protein-4; 
Latent TGF-beta 
binding protein-4 


O.OUOol 




<NUINt> 


<NU!Nd> 


<NUNb> 




transposase - 
Synechocystis sp. (PCC 
6803)>GP:D90909JO 
8 Synechocystis sp; 
PCC6803 complete 
genome, 11/27, 
1311235- 1430418; 
Transposase; 
ORF ID:slr2062 


U.UUUj 1 


33 








VJlxlN IVUJUo 

E 


riP A XJT II FMQ 
VJKAJNULIINo 

PRECURSOR 
(ACROGRANIN)>GP: 
MUSAPJ Mouse gene 
for acrogranin 
precursor, complete cds 


v.UUUZZ 


34 


<NONE> 


<NONE> 


<NONE> 


CA21 MOUS 
E 


PROCOLLAGEN 
ALPHA 2(1) CHAIN 
PRECURSOR>PlR2 : A 
43291 collagen alpha 
2(1) chain precursor - 
mouse>GP:MMCOLl 
A2J Mouse COL1A2 
mRNA for pro-alpha- 
2(1) collagen 


0.00016 


35 i 


<NONE> 


<NONE> 


<NONE> 


MMMHC29N 
7_2 


Mus musculus major 
histocompatibility 
locus class III 
region:butyrophilin-like 
protein gene, partial 
cds;Notch4,PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 


8.00E-05 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












alpha, palmitoyl- 




36 


<NONE> 


<NONE> 


<NONE> 


NFH_RAT 


NEUROFILAMENT 
TRIPLET H PROTEIN 
(200 KD 

NEUROFILAMENT 
PROTEIN) (NF-H) 
(FRAGMENT) 


2.40E-05 


37 


<NONE> 


<NONE> 


<NONE> 


HUMVWFM 
1 


Human von Willebrand 
factor mRNA, 3* end; 
Von Willebrand factor 
prepropeptide 


1.70E-05 


38 


<NONE> 


<NONE> 


<NONE> 


CGHU2E 


collagen alpha 2(XI) 
chain - human 
(fragment) 


2.00E-06 


39 


<NONE> 


<NONE> 


<NONE> 


A61183 


hypothetical protein 
(sdsB region) - 
Pseudomonas sp. 


4.90E-08 


40 








I MoL i cAj 

T 


HVPHTHPTIf AT 71 1 

KD PROTEIN IN j 
DSK2-CAT8 
INTERGENIC 
REGION>PIR2:S5458 
5 hypothetical protein 
YMR278w - yeast 
(Saccharomyces 
cerevisiae)>GP:SC802 
1X_4 S;cerevisiae 
chromosome XIII 
cosmid 8021; 
Unknown; YM8021;04, 
unknown, len: 622, 
CAI: 0;16, 




41 


<NONE> 


<NONE> 


<NONE> 


MTCY2 10_31 


Mycobacterium 
tuberculosis cosmid 
Y2 10; Unknown; 
MTCY210;31, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


3.10E-10 
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Nearest 
Neighbor 
(BiastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


ID 






p 

VALUE 






p 

VALUE 


42 


<NONE> 


<NONE> 


<NONE> 


CEC01G10_5 


Caenorhabditis elegans 
cosmid C01G10, 
complete sequence; 
C01G10;8;CDNA EST 
CEMSC45R comes 
from this 

gene>GP:CEC01G10_ 
5 Caenorhabditis 
elegans cosmid 
C01G10; C01G10;8; 
CDNA EST 
CEMSC45R comes 
from this gene 


2.30E-12 


43 


<NONE> 


<NONE> 


<NONE> 


HSU15779J 


Human p70 (ST5) 
mRNA, alternatively 
spliced, complete cds; 
Differentially 
expressed; alternatively 
spliced 


9.50E-14 


44 


<NONE> 


<NONE> 


<NONE> 


MTCY210_31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
M I CYZ1U;3 1, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


1.70E-17 


4S 
tj 


\JSJ 1 tVJJ 


PI i r tvnctf* 1 i 1 1 m 

Ul\siy UDLC1IU111 

discoideum PrlA 
(prlA) mRNA, 
partial cds. 


1 i 




Oanin rprin PPARR 

VJCLl 1 1U 1 CI 1U irAIxD 

gene, partial cds; 
Nuclear receptor C 
domain 


0.95 


46 


Z92832 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
F31D4; HTGS 
phase 1 . 


1 


U93472_l 


Danio rerio PPARB 
gene, partial cds; 
Nuclear receptor C 
domain 


0.94 


47 


L36557 


Oryza sativa 
(clone pRG3) 
repetitive 
element. 


1 


HSU61262J 


Human neogenin 
mRNA, complete cds 


0.89 
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Nearest 






Nearest 








Neighbor 






Neighbor 








(BiastN vs. 






(BlastX vs. 








Genbank) 






Non- 
Redundant 
Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


48 


AF005898 


Homo sapiens 

Na,K-ATPase 

beta-3 subunit 

pseudogene, 

complete 

sequence. 


1 


LRP1_CHICK 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 
1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
KbLbr 1 UK) 
(A2MR)>PIR2:A53102 
LDL receptor-related 
protein / alpha-2- 
macroglobulin receptor 
precursor - 

chicken>GP:GGLRPA 
2MR 1 G;gallus 
mRNAforLRP/alp 


0.85 


49 


U 18795 


Saccharomyces 
cerevisiae 
chromosome V 
cosmids 9669, 
8334,8199, and 
lambda clone 

1 1 cc\ 

1 loU. 


1 


NKC1 SQUA 
C 


BUMETANIDE- 
SENSITIVE SODIUM- 
(POTASSIUM)- 
CHLORIDE ! 
COTRANSPORTER 2 
(NA-K-CL 

b Y Mr UK J JbK )>rli\Z. 

A53491 bumetanide- 
sensitive Na-K-Cl 
cotransporter - spiny 

dOgilSn>Or:oANK.L.L 

M Squalus acanthias 
bumetanide-sensitive 
Na-K-Cl cotransport 
protein (NKCC 


0.73 






Hnmn cjinipnc 

HTGS phase 1, 
54 unordered 


1 


RXFN CI OR 

6 


BOTULINUM 
NEUROTOXIN TYPE 
E, NONTOXIC 


0.71 






pieces. 






COMPONENT>GP:C 












LOENT120J 
C;botulinum gene for 
nontoxic component of 
progenitor toxin, 
complete cds 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 

51 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


AC002345 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Human 17; 
HTGS phase 1, 
10 unordered 
pieces. 


1 


P3K2JDICDI 


PHOSPHATIDYLINO 
SITOL 3-KINASE 2 
(EC 2.7.1.137) (PI3- 
KINASE) (PTDINS-3- 
KINASE) 

(PI3K)>GP:DDU23477 
_1 Dictyostelium 
discoideum 
phosphatidyliriositol- 
4,5-diphosphate 3- 
kinase (PIK2) mRNA, 
complete cds 


0.58 


52 


XI 4253 


Human mRNA 
for cripto protein. 


1 


155651 


noradrenaline 
transporter - 
bovine>GP:BTU09198 
_1 Bos taurus 
noradrenaline 
transporter mRNA, 
complete cds 


0.55 


53 


U23516 


Caenorhabditis 
elegans cosmid 
B0416. 


1 


169024 


MHC sex-limited 
protein - mouse 
(fragment)>GP:MUSM 
HC4AD_1 Mouse class 
III H2-Slp sex-limited 
protein gene, exons 1, 2 
and 3; MHC sex- 
limited protein 


0.47 


54 


AB006698 


Arabidopsis 
man ana genomic 
DNA, 

chromosome 5, 
PI clone: 
MCL19. 


1 


S81293J 


LI {insertion sequence, 
provirus/ [numan 
papillomavirus type 6b 
HPV6b, KP4, Genomic 
Mutant, 121 nt]; 
Authors note this 
reading frame results 
from a 454 bp deletion 
and resulting 


0.25 


55 


K03458 


Human 

immunodeficienc 
y virus type 1, 
isolate Zaire 6, 
vif, tat, rev, env, 
nef genes and 3' 
LTR. 


1 


SI 3383 


hydroxyproline-rich 
glycoprotein - sorghum 


0.24 
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ID 


ACCiL&MUN 


a^jy tut isviu 


Jr 

VALUE 






r 

VALUE 


56 


B26794 


T1016TR TAMU 
Arabidopsis 
thaliana genomic 
clone T1016. 


1 


RK34 PORP 
U 


CHLOROPLAST 50S 

RIBOSOMAL 

PROTErN 

L34>PIR2:S73I11 

ribosomal protein L34 - 

red alga (Porphyra 

purpurea) 

chloroplast>GP:PPU38 
804_4 Porphyra 
purpurea chloroplast 
genome, complete 
sequence; 50S 
ribosomal protein L34 


0.021 


57 


Z98950 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
507115; HTGS 
phase 1. 


1 


D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp.>GP:HSNCOL4 1 
HydraN-COL4 
mRNA for mini- 
collagen; No start 
codon 


0.02 


58 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


1 i 


DMU15602_1 


Drosophila 
melanogaster (zeste- 
white 4) mRNA, 
complete cds; Similar 
to C; elegans B0464;4 
gene product, Swiss- 
Prot Accession Number 
Q03562 


0.019 


59 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


1 


X A/~\l IPC 

CR2_MOUSE 


COMPLEM ENT 
RECEPTOR TYPE 2 
PRECURSOR (CR2) 
(COMPLEMENT C3D 
RECEPTOR)>PIR2:A4 
3526 complement 
C3d/Epstein-Barr virus 
receptor 2 precursor - 
mouse>GP:MUSCR2A 
A_l Murine 
complement receptor 
type 2 (CR2) mRNA, 
complete cds; 
Complement receptor 
type 


U.UU74 
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Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
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ID 
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P 
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DESCRIPTION 


P 
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j 60 


B65337 


CIT-HSP- 
202IH21.TF 
CIT-HSP Homo 
sapiens genomic 
clone 2021H21. 


i 1 


A38096 


perlecan precursor - 
human>GP:HUMHSP 
G2B1 Human heparan 
sulfate proteoglycan 
(HSPG2) mRNA, 
complete cds 


0.0051 


61 


U84722 


Human vascular 
endothelial 
cadherin mRNA, 
complete cds. 


1 


HSTAFII13J 


H;sapiens mRNA for 
TAFII135;Subunitof 
RNA polymerase II 
transcription factor 
TFIID 


0.0012 


i 62 


L41493 


Avian rotavirus 
(strain turkey 1) 
genomic segment 
4 outer capsid 
protein (VP8*) 
gene. 


1 


Y328 MYCP 
N 


HYPOTHETICAL 
PROTEIN MG328 
HOMOLOG>PIR2:S73 
693 MG328 homolog 
P01_orfl033- 
Mycoplasma 
pneumoniae (ATCC 
29342) 

(SGC3)>GP:MPAE000 
035_2 Mycoplasma 
pneumoniae irom Doses 
442306 to 452472 
(section 35 of 63) of 
the complete genome; 
MG328 homolog, 


0.00015 


63 1 


D63139 


Aeromonas sp. 
gene for 
chitinase, 
complete and 
partial cds. 


1 


MTCY16B7_3 


Mycobacterium 
tuberculosis cosmid 
SCY16B7; Unknown; 
MTCY16B7;03, 
initiation factor, len: 
900, similar at C- 
terminai half to eg 
IF2__BACSU P17889 
initiation factor if-2 
(716 aa), fasta 


6.30E-05 
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P 
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P 
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64 


J04974 


Human alpha-2 
type XI collagen 
mRNA 
(COL11A2). 


1 


GDF6J30VI 
N 


GROWTH/DIFFEREN 
TIATION FACTOR 
GDF-6 PRECURSOR ( 
(CARTILAGE- 
DERIVED 
MORPHOGENETIC 
PROTEIN 2) (CDMP- 
2) 

(FRAGMENT)>PIR2: 
B55452 cartilage- 
derived morphogenetic 
protein 2 precursor - 
bovine 

(fragment)>GP:BTU 1 3 
66 1_1 Bos taurus 
cartilage-derived morp 


1.00E-05 
















AC002394 


Homo sapiens 
Chromosome 16 
BAC clone 
CIT987-SKA- 
211C6 --complete 
genomic 
sequence, 
complete 
sequence. 


1 


CELC14F11_ 

0 


Caenorhabditis elegans 
cosmid C14r 1 1 ; 
Similar to aspartate 
aminotransferase; 
coded for by C; elegans 
cDNA CEMSF95FB; 
coded for by C; elegans 
cDNA yk41e4;3; coded 
for by C; elegans 


4.60E-06 


66 


AB002312 


Human mRNA 

-fXr VTA A(\11A 
TOr ftJAAUJ 14 

gene, partial cds. 


1 


NAT1_YEAS 

T 
I 


N-TERMINAL 

Att I i LI KAiNsr t,t\ 

ASE1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 
ASE1) 


1.00E-09 


67 


AC003085 


Human BAC 
clone RG094H21 
from 7q21-q22, 
complete 
sequence. 


1 


DP19_CAEEL 


DPY-19 

PROTEIN>PIR2:S446 
29 £22b7. 10 protein - 
Caenorhabditis 
elegans>GP:CELF22B 
7_9 C;aenorhabditis 
elegans (Bristol N2) 
cosmid F22B7; 
Putative 


4.20E-11 


68 


X55026 


P.anserina 
complete 
mitochondrial 
genome. 


1 


NAT1 YEAS 
T 


N-TERMINAL 
ACETYLTRANSFER 
ASE1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 


8.40E-12 
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ACCESSION 
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P i 
VALUE 












ASE1) 




69 


Z95399 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39B6; HTGS 
phase 1 . 


1 


CER06B9_5 


Caenorhabditis elegans 
cosmid R06B9, 
complete sequence; 
R06B9;b; Protein 
predicted using 
Genefinder; 
preliminary prediction 


1.50E-24 


70 


AC002339 


Arabidopsis 

thaliana 

chromosome II 

BACT11A07 

genomic 

sequence, 

complete 

sequence. 


0.99 


POLG BVDV 
S 


GENOME 

POLYPROTEIN>PIRl 
:A44217 genome 
polyprotein - bovine 
viral diarrhea virus 
(strain SD- 

1)>GP:BVDP0LYPR 
0_1 Bovine viral 
diarrhea virus 
polyprotein RNA, 
complete cds; Putative 


1 


71 


Y08559 


B.subtilis urease 
operon and 
downstream 
DNA. 


0.99 


LRJMZAEEL 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 
PRECURSOR 
(LRP)>PIR2:A47437 
LDL-receptor-related 
protein - 
Caenorhabditis 
elegans>GP:CEF29Dl 
1_2 Caenorhabditis 
elegans cosmid 
F29D1L complete 
sequence; F29D11;1; 
Protein predicted using 
Genefi 


1 
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72 


U67548 


Methanococcus 
jannaschii from 
bases 9862 19 to 
996377 (section 
90 of 150) of the 
complete 
genome. 


0.99 


YB60 YEAS 
T 


HYPOTHETICAL 16.3 
KD PROTEIN IN 
DUR1,2-NGR1 
INTERGENIC 
REGION>PIR2:S4608 
4 probable membrane 
protein YBR210w- 
yeast (Saccharomyces 
cerevisiae)>GP:SCYB 
R210W_1 S;cerevisiae 
chromosome II reading 
frame ORP YBR210w 


1 


73 


U51645 


Plasmodium 
falciparum 
cytidine 
triphosphate 
synthetase gene, 
complete cds. 


0.99 


HPSVRPLJ 


Sin Nombre virus (NM 
H10)RNAL segment 
encoding RNA 
polymerase (L protein), 
complete cds; Viral 
RNA polymerase (L 
protein); 

Putative>GP:HPSVRP 
LA 1 Sin Nombre 
virus (NM Rl 1) RNA 
L segment encoding 
RNA polymerase (L 
protein), complete cds; 
Vir 


0.99 


74 


Z49889 


Caenorhabditis 
elegans cosmid 
T06H11, 
complete 
sequence. 


0.99 


MUSHDPRO 
BJ 


Mouse alternatively 
spliced HD protein 
mRNA, complete cds 


0.021 


75 


Z69374 


Human DNA 
sequence from 
cosmid LI 74G8, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3 contains a 
pair of ESTs. 


0.99 


NCPR YEAS 
T 


NADPH- 

C YTOCHROME P450 
REDUCTASE (EC 
1.6.2.4) (CPR) 


0.017 ' 
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76 


Z35847 


S.cerevisiae 
chromosome II 
reading frame 
ORF YBL086c. 


0.99 


CYPA CAEE 
L 


PEPTIDYL-PROLYL 

CIS-TRANS 

ISOMERASE10(EC 

5.2.1.8) (PPIASE) 

(ROTAMASE) 

(CYCLOPHILIN- 

in^>np*PFT R07s? a 

Caenorhabditis elegans 
cosmid B0252; Similar 
to peptidyl-prolyl cis- 
trans isomerase 
(PPIASE) 

(CYCLOPHILIN)>GP: 
CEU34954J 
Caenorhabditis el 


0.0044 


77 


LJJJJU 


RattiiQ norvpcririic 

glutathione S- 
transferase Yb3 
subunit gene, 
comDlete cds 


0 99 

V.77 


PFT R 148 1 


f'npnrirhahHitic plponnc 
v^acnui iiauuiiJ o cicgaiio 

cosmid R148; Contains 
similarity to drosophila 
DNA-binding protein 
K10 fNIDe8148V 
coded for by C; elegans 
cDNA ykll8ell;5; 
coded for by C; elegans 
cDNA 




78 


Y00324 


Chicken 

vitellogenin gene 
3' flanking 
region. 


0.99 I 


A56922 


transcription factor shn 
- fruit fly (Drosophila 
melanogaster) 


0.0023 


79 


M32659 


D.melanogaster 
Shabl 1 protein 
mRNA, complete 
cds. 


0.99 


OMU25146J 


Oncorhynchus mykiss 
recombination 
activating protein 2 
gene, partial cds 


0.0017 


80 


Z69880 


H.sapiens 
SERCA3 gene 
(partial). 


0.99 


M84D DRO 
ME 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD>PIR2:S257 
75 testis-specific 
protein Mst84Dd - fruit 
fly (Drosophila 
meianogaster)>GP:DM 
MST84D_4 
D;melanogaster 
Mst84Da, Mst84Db, 
Mst84Dc and Mst84Dd 
genes for put; sperm 
protein 


0.0011 
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Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
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P 
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81 


M991 66 


Escherichia coli 
Tip repressor 
binding protein 
(wrbA) gene, 
complete cds. 


0.99 


MTU88962J 


Mycobacterium 
tuberculosis unknown 
protein gene, partial cds 


6.50E-07 


82 


X99257 


R.norvegicus 
mRNA for lamin 
C2. 


0.99 


MIU68729J 


Meloidogyne incognita 
cuticle preprocollagen 
(col-2) mRNA, 
complete cds; Putative 


1.60E-09 


83 


AC002432 


Human BAC 
clone RG317G18 
from 7q31, 
complete 
sequence. 


0.98 


1FMDC 


Foot and mouth disease 
virus type c-s8cl, chain 
C - foot and mouth 
disease virus type c- 
s8cl expressed in 
hamster kidney cells 


0.14 


84 


Z34799 


Caenorhabditis 
elegans cosmid 
F34D10, 
complete 
sequence. 


0.98 


MMU57368J 


Mus musculus EGF 
repeat transmembrane 
protein mRNA, 
complete cds; Notch 
like repeats; notch 2 


0.0028 


85 


B 15207 


344E15.TV 
CIT978SKA1 
Homo sapiens 

344E15. 


0.98 


POLG HCVJ 
6 


GENOME 
POLYPROTEIN 
(CONTAINS: CAPSID 
PROTFIN C (CORE 
PROTEIN); MATRIX 
PROTEIN 
(ENVELOPE 
PROTEIN M); MAJOR 
ENVELOPE 
PROTEIN E; 
NONSTRUCTURAL 
PROTEINS NS1.NS2, 
NS4A AND NS4B; 
HELICASE(NS3); 
RNA-DIRECTED 
RNA POLYMERASE 
(EC 2.7.7.48) 
(NS5))>PI 


0.00083 
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86 


AC002412 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.98 


KDG1 ARAT 
H 


DIACYLGLYCEROL 
KINASE 1 (EC 
2.7.1.107) 
(DIGLYCERIDE 
KINASE) (DGK 1) 
(DAG KINASE 
1)>PIR2:S71467 

Hi ncvlolvppml Vitijicp 

(EC 2.7.1.107) 
ATDGK1 - 
Arabidopsis 
thaliana>GP'ATHATD 
GK1_1 Arabidopsis 
thaliana mRNA for 
diacylglycerol kinase, 
complete c 


0.00024 


87 


X57010 


Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exons E2-E15. 


0.98 


D80005J 


Human mRNA for 
KIAA0183 gene, 
partial cds 


5.90E-10 


88 


M83093 


Neurospora 

dependent protein 
kinase (cot-1) 
gene, complete 
cds. 


0.98 


YA53_SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTFIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome 1 
cosmid cl3Al 1; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 


3.00E-22 


89 


U96271 


Helicobacter 
pylori heat shock 
protein 70 
(hsp70) gene, 
complete cds. 


0.97 


SLMEN6J 


S;latifolia mRNA for 
Men-6 

protein>GP:SLMEN6_ 
1 S;latifolia mRNA for 
Men-6 protein 


0.43 


90 


U49944 


Caenorhabditis 
elegans cosmid 
C39E6. 


0.97 


RON HUMA 
N 


MACROPHAGE 
STIMULATING 
PROTEIN RECEPTOR 
PRECURSOR (EC 
2.7.1.1 12)>PIR2:I38 18 
5 protein-tyrosine 
kinase (EC 2.7.1.112), 
receptor type ron - 
human>GP:HSRONJ 
H;sapiens RON mRNA 
for tyrosine kinase; 
Putative 


0.034 
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91 


Y09255 


B.cereus dnal 
gene, partial. 


0.97 


CELT05C1_5 


Caenorhabditis elegans 
cosmid T05C1; Coded 
for by C; elegans 
cDNA yk30f6;3; coded 
for by C; elegans 
cDNA yk34fl0;3 


0.00043 


92 


AC002413 


*** 

^FOUFNOING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.96 


CELC44E4_5 


Caenorhabditis elegans 

similarity to the 
drosophila hyperplastic 
disc protein 
(GB:L 14644); coded 
for by C; elegans 
cDNA yk49h6;5; coded 
for by C; elegans 
cDNA 


1 


93 j 


U41625 


Caenorhabditis 
elegans cosmid 
K03A1. 


0.96 


HMGC HUM 
AN 


HIGH MOBILITY 
GROUP PROTEIN 
HMGI- 

C>PIR2:JC2232 high 
mobility group I-C 
phosphoprotein - 
human>GP:HSHMGIC 

frS 1 Human hioh- 

mobility group 
phosphoprotein isoform 
I-C (HMGIC)gene, 
exon 

5>GP:HSHMGICP_1 
H;sapiens mRNA for 
HMGI-C 

protein>GP:HSHMGIC 


1 


94 


Z82202 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
34P24; HTGS 
phase 1 . 


0.96 


YTH3 CAEE 
L 


HYPOTHETICAL 75.5 
KD PROTEIN 
C14A4.3 IN 
CHROMOSOME 
II>GP:CEC14A4J3 
Caenorhabditis elegans 
cosmid C14A4, 
complete sequence; 
C14A4;3; Weak 
similarity with a B; 
Flavum translocation 
protein (Swiss Prot 
accession number 
P38376) 


0.73 
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95 


AL008734 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
324M8; HTGS 
phase 1. 


0.96 


S25299 


extensin precursor 
^cione i om - 
tomato>GP:TOMEXT 
ENB_1 L;esculentum 
extensin (class II) gene, 
complete cds 


0.0004 


yo 


L15388 


Human G 
protein-coupled 
receptor kinase 
(GRK5) mRNA, 
complete cds. 


0.96 


HUMCOL7A1 
X_l 


Homo sapiens (clones: 
CW52-2, CW27-6, 
CW15-2, CW26-5, 11- 
67) collagen type VII 
intergenic region and 
(LajL/A l ) gene, 
complete cds 


4.60E-06 


97 


X97384 


A.thaliana atran3 
gene. 


0.95 


<NONE> 


<NONE> 


<NONE> 


98 


M62505 


Human C5a 
anaphylatoxin 
receptor mRNA, 
complete cds. 


0.95 ; 


RIPB_BRYDI 


RIBOSOME- 
INACTIVATING 
PROTEIN BRYODIN 
(RRNA N- 

GLYCOSIDASE)(EC 
3.2.2.22) 

(FRAGMENT)>PIR2: 
S16491 rRNAN- 
glycosidase (EC 
3.2.2.22) bryodin-red 
bryony (fragment) 


0.83 


99 


D28778 


Cucumber mosaic 
virus KNA I tor 
la, complete 
sequence. 


0.95 


POLS_RUBV 
M 


STRUCTURAL 
rULYrKU 1 blN 
(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND 

E2)>PIR1:GNWVR3 
structural polyprotein - 
rubella virus (strain 
M33)>GP:TORUB24S 
_1 Rubella virus 24S 
subgenomic mRNA for 
structural proteins El, 
E2 and C; 


0.00037 


100 


AF016202 


Homo sapiens 
immunoglobulin 
heavy chain 
CDR3 gene, 


0.93 


HSU79716J 


Human reelin (RELN) 
mRNA, complete cds 


1 
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partial cds. 










101 


Z68303 


Caenorhabditis 
elegans cosmid 
ZK809, complete 
sequence. 


0.93 1 


HS5HT4SAR__ 
1 


H;sapiens mRNA for 
serotonin 4SA receptor 
(5-HT4SA-R) 


0.87 


102 


X03049 


E. coli DNA 
sequene 5' to 
origin of 
replication oriC. 


0.93 


S37594 


mucin - human 
(fragment) 


0.0019 


103 


M32659 


D.melanogaster 
Shabll protein 
mRNA, complete 
cds. 


0.93 


S38480 


nonstructural protein - 
rubella 

virus>GP:RVM33NP_ 
1 Rubella virus M33 
RNAfora 

nonstructural protein; 
Nonstructural protein 
genes 


2.30E-06 


104 


D88687 


Human mRNA 
for KM- 102- 
derived 
reductase-like 
factor, complete 
cds. 


0.93 


BAT3_HUMA 
N 


LARGE PROLINE- 
RICH PROTEIN BAT3 
(HLA-B- 
ASSOCIATED 
TRANSCRIPT 
3)>PIR2:A35098 MHC 
class III 

histocompatibility 
antigen HLA-B- 
associated transcript 3 - 
human>GP:HUMBAT 
3AJ Human HLA-B- 
associated transcript 3 
(BAT3) mRNA, 
complete 

cds>GP:HUMBAT3 


8.70E-07 


105 


D 16847 


Mouse mRNA for 
stromal cell 
derived protein- 1, 
complete cds. 


0.93 


S52796 


prpL2 protein - human 
(fragment)>GP:HSPRP 
L2_l H;sapiens mRNA 
for PRPL-2 protein 


3.20E-08 
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DESCRIPTION 


P 
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P 
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106 


D90915 


Synechocystis sp. 

PCC6803 

complete 

genome, 17/27, 

2137259- 

2267259. 


0.92 


YEK9 YEAS 
T 


HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37_20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 


5.90E-05 


107 


AJ001101 


Mus musculus 
mRNA for 
gClqBPgene. 


0.92 


DMU58282J 


Drosophila 
melanogaster Bowel 
(bowl) mRNA, 
complete cds; 
Transcription factor; 
C2H2 zinc finger 
protein; zinc fingers 
have extensive 
sequence similarity to 
Drosophila odd- 

orwi JJfJtLi 


3.50E-05 


108 


X57108 


Human gene for 
cerebroside 
sulfate activator 
protein, exons 10- 
14. 


0.92 


S69032 


hypothetical protein 
YPR144c- yeast 
(Saccharomyces 
cerevisiae)>GP:YSCP9 
659_17 Saccharomyces 
cerevisiae chromosome 
XVI cosmid 9659; 
Yprl44cp; Weak 
similarity near C- 
terminus to RNA 
Polymerase beta 
subunit (Swiss Prot; 
accession number 
P11213) 


4.30E-21 
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Neighbor 
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P 
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P 


ID 






VALUE 






VALUE 


109 


D14635 


Caenorhabditis 
elegans DNA for 
EMB-5. 


0.91 


YM13 YEAS 
T 


PUTATIVE ATP- 
DEPENDENT RNA 
HELICASE 
YMR128W>PIR2:S53 
058 probable 
membrane protein 
YMR128w- yeast 
(Saccharomyces 
cerevisiae)>GP:SC955 
3_4 S;cerevisiae 
chromosome XI 11 
cosmid 9553; 
unicnown, YiviyDDj,u% 
probable ATP- 
dependent RNA 
helicase, len: 


0.69 


110 


B55500 


CIT-HSP- 
387J2.TFB CIT- 
HSP Homo 
sapiens genomic 
clone 387J2. 


0.91 


U97553_79 


Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Unknown 


0.00016 


111 


X03049 


E. coli DNA 
sequene 5' to 
origin of 
replication oriC. 


0.9 


POL_MLVAV 


POL POLYPROTEIN 
(PROTEASE (EC 
i.4.23.-;, iCbvbKob 
TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASEH 
(EC 

3.1.26.4))>PIR1:GNM 

\7CWf r\ni nnlvnmtptn - 
VVJ V pUl ysJiyyn UIC1U 

AKV murine leukemia 
virus 


0.0019 


112 


U91327 


Human 

\*1 11 \Jll J VJiUill w 

12pl5BAC clone 
CIT987SK-99D8 
complete 
sequence. 


0.89 


JC5568 


serine protease (EC 

J. f III - JC1 1 Cl C 1 Cl 

marcescens 


1 


113 


X 13295 


Rat mRNA for 
alpha-2u 
globulin-related 
protein. 


0.89 


MNGPOLYJ 


Mengo virus 
polyprotein genome, 
complete cds withe 
repeats 


1 


114 


Z78415 


Caenorhabditis 
elegans cosmid 
C17G1, complete 
sequence. 


0.89 


AB000121J 


Mouse mRNA for 
TBPIP, complete cds; 
TBP1 interacting 
protein 


0.39 
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P 
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ACCESSION 
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P 

VALUE 


115 


AC002308 


♦ ♦ ♦ 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 
22ql 1 BAC 
Clone 1000e4; 
HTGS phase 1, 
26 unordered 
pieces. 


0.88 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D 1044.2 IN 
CHROMOSOME 
III>GP:CELD1044_4 
Caenorhabditis elegans 
cosmid D1044 


0.0037 


116 


AC002073 


Human PAC 
clone DJ515N1 
from 22qlL2- 
q22, complete 
sequence. 


0.88 


S28499 


probable finger protein 
- rat>GP:RNZFP_l 
R;norvegicus mRNA 
for putative zinc finger 
protein 


1.10E-31 


117 


Z83848 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1. 


0.87 


NDL DROM 
E 


SERINE PROTEASE 
NUDEL PRECURSOR 
(EC 3.4.21.- 
)>PIR2:A57096 nudel 
protein precursor - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
U29 153_1 Drosophila 
melanogaster nudel 
(ndl) mRNA, complete 
cds; Serine protease; 
Soma dependent gene 
required matern 


1 


118 




C o t* n r\ rli 0 Y\/\ 1 1 i c 

^acnornauuiub 
elegans cosmid 
K06A1. 


ft R7 


t\ r uxj iuo_j 


I_I r\rn r\ conipnc i^llfO 
nuiuu oapiCIlS UllVX 

kinase (CLK2), 
propinl, cotel, 
glucocerebrosidase 
(GBA), and metaxin 
genes, complete cds; 
metaxin pseudogene 
and glucocerebrosidase 
pseudogene; and 
thrombospondin3 
(THBS3) 


0 21 


119 


Z68181 


H.vulgaris 
mRNA for 
elongation factor 
EF1 -alpha. 


0.87 


RABCY450C 
J 


Rabbit cytochrome P- 
450 gene, clone pP- 
450PBc3, 3' end 


0.14 
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Proteins) 
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ACCESSION 


DESCRIPTION 


P 


ACCESSION 
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P 


ID 
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VALUE 


120 


AC000033 


Homo sapiens 
chromosome 9, 
complete 
sequence. 


0.87 


VWF CANF 
A 


VON WILLEBRAND 
FACTOR 

PRECURSOR>GP:DO 
GVWGJ Canis 
familiaris von 
Willebrand factor 
mRNA, complete cds 


0.036 






i^ae no ma ou i u s 
elegans cosmid 
K06A1. 


U.50 




K^j\i" i — cysidiin-reidicu 
protein [rats, Wistar 
albino, mRNA Partial, 
213 nt];Cystat in- 
related protein; 

ivlCLIlUU. IsUIll/CpiUal 

translation supplied by 
author; This sequence 
comes from Fig; 


O fid 


122 


Z89651 


F.rubripes GSS 
sequence, clone 
090I24cD5. 


0.86 


CPU65981J 


Cryptosporidium 
parvum P-ATPase gene 
(CppA-El)gene, 
complete cds; Putative 
calcium-ATPase 


0.6 


123 


Z94055 


Human DNA 
sequence from 
PAC24M15on 
chromosome 1. 
Contains 
tenascin-R 

^resir IlllJlJ, Co 1 . 


0.86 


GLTB SYNY 
3 


FERREDOXIN- 
DEPENDENT 
GLUTAMATE 
SYNTHASE 1 (EC 
1.4.7.1) (FD- 
GOGAT)>P1R2:S6022 

O giUlCUUalv 2>j 11 LI loot 

(ferredoxin) (EC 
1.4.7.1) gltB- 
Synechocystis sp. (PCC 
6803)>GP:D90902_66 
Synechocystis sp; 
PCC6803 complete 
genome, 4/27, 402290- 
524345; Gluta 


0.03 


124 


Z49250 


Human DNA 
sequence from 
cosmid HW2, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. 


0.86 


TRSCAPSID 
1 


Tobacco ringspot virus 
capsid protein gene, 
complete cds 


3.00E-06 
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P 
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Z92855 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y48C3; HTGS 
phase 1. 


0.84 


AE000809_8 


Methanobacterium 
thermoautotrophicum 
from bases 161632 to 
172569 (section 15 of 
148) of the complete 
genome; Aspartyl-. 
tRNA synthetase; 
Function Code:10;07- 
Metabolism of 


1 


126 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana TAMU' 
BAC T11J7 
genomic 
sequence near 
marker 'm283'; 
HTGS phase 1,2 
unordered pieces. 


0.83 


CET01E8_3 


Caenorhabditis elegans 
cosmidT01E8, 
complete sequence; 
T01E8;3; Similar to 1- 
phosphatidylinositol- 
4,5-bisphosphate 
phosphodiesterase; 
cDNA EST CEESG02F 
comes from this gene; 


0.86 


127 


AL008716 


Human DNA 
sequence *** 
SEQUENCING 

TXT PRfVlRFQQ 
11N rKUUi\Xloo 

*** from clone 
206C7; HTGS 
phase 1. 


0.83 j 


HIVU51189_5 


HIV-1 clone 93th253 
from Thailand, 
complete genome; Tat 
protein 


0.86 


128 


AC002340 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana TAMU' 
BAC Til J7 
genomic 
sequence near 
marker 'm283'; 
HTGS phase 1,2 
unordered pieces. 


0.83 


S60257 


meltrin alpha - 
mouse>GP:MUSMAB 
_1 Mouse mRNA for 
meltrin alpha, complete 
cds 


0.0013 
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s 


EQ 
ID 
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DESCRIPTION 


P 
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P 

VALUE 


129 


Z83848 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase I. 


0.82 


AROl PNEC 
A 


PENTAFUNCTIONAL 
AROM 

POLYPEPTIDE 
(CONTAINS: 3- 
DEHYDROQUINATE 
SYNTHASE (EC 
4.6.1.3), 3- 

DEHYDROQUINATE 
DEHYDRATASE (EC 
4.2.1.10) (3- 
DEHYDROQUINASE) 
, SHIKIMATE 5- 
DEHYDROGENASE 
(EC 1.1.1.25), 
SHIKIMATE KINASE 
(EC 2.7.1.71), AND 
EPSP SYNTHASE (E 


0.0098 


130 


AF029308 


Homo sapiens 
chromosome 9 
duplication of the 
T cell receptor 
beta locus and 
tripsinogen gene 
families. 


0.8 


CELZK84J 


Caenorhabditis elegans 
cosmid ZK84; Final 
exon in repeat region; 
similar to long tandem 
repeat region of 
sialidase 

(SP:TCNA TRYCR, 
P23253)and 
neurofilament H 
protein; coded for by C; 
elegans 


2.00E-08 


131 


AC002458 


Human BAC 
clone RG098M04 
from 7q2I-q22, 
complete 
sequence. 


0.78 


IGF2_PIG 


INSULIN-LIKE 
GROWTH FACTOR II 

VJ A Vv » T A A A A £A\ \^ A IV A A 

PRECURSOR (IGF- 
II)>GP:SSIGF2_1 
S;scrofa mRNA IGF2 
for insulin-like-growth 
factor 2; Insulin- like- 
growth factor 2 
preproprotein 


0.44 


132 


Z83843 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
368A4; HTGS 
phase 1. 


0.78 


PAR51AJ 


P;tetraurelia51A 
surface protein gene, 
complete cds 


0.0014 
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P 
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133 


X03021 


Human gene for 
granulocyte- 
macrophage 
colony 

stimulating factor 
(GM-CSF). 


0.78 


CEF57B1_3 


Caenorhabditis elegans 
cosmid F57B1, 
complete sequence; 
F57B1;3; Protein 
predicted using 
Genefinder; similar to 
collagen 


2.20E-05 


134 


Z74825 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w. 


0.77 


SYLM SCHP 
O 


PUTATIVE LEUCYL- 
TRNA 

SYNTHETASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
6.1.1.4) (LEUCINE-- 
TRNA 

LIGASE)>PIR2:S6248 
6 hypothetical protein 
SPAC4G8.09 - fission 
yeast 

(Schizosaccharomyces 
pombe)>GP:SPAC4G8 
_9 S;pombe 
chromosome I cosmid 
c4G8; Unknown; 


0.96 


135 


Z74825 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w. 


0.77 


RNU59809J 


Rattus norvegicus 
mannose 6- 
phosphate/insul in-like 
growth factor 11 
receptor (M6P/IGF2r) 
mRNA, complete cds; 
Also termed IGF- 
II/Man 6-P receptor, 
MPR CI-MPR 


0.01 


136 


U80445 


Caenorhabditis 
elegans cosmid 
C50F2. 


0.76 


S28499 


probable finger protein 
- rat>GP:RNZFPJ 
Rjnorvegicus mRNA 
for putative zinc finger 
protein 


1.10E-31 


137 


Z78545 


Caenorhabditis 
elegans cosmid 
M03B6, complete 
sequence. 


0.75 ! 


RRU73586J 

- 


Rattus norvegicus 
Fanconi anemia group 
C mRNA, complete 
cds; Fanconi anemia 
group C protein; 
Similar to human FAC 
protein, GenBank 
Accession Numbers 
X66893 andX66894 


0.023 
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138 


Z97630 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 

phase 1. 


0.74 


HSMSHREC 
A J 


H;sapiens mRNA for 
MSH receptor; Author- 
given protein sequence 
is in conflict with the 
conceptual translation 


1 0.036 


139 


AF007269 


Arabidopsis 
thaliana BAC 
1G002N01. 


0.71 


HSU95090J 


Homo sapiens 
chromosome 1 9 cosmid 
F 19541, complete 
sequence, r iyj*n_i, 
Hypothetical (partial) 
protein similar to 
proline oxidase 


0.16 


140 


AC002393 


Mouse 

r> A P0JMI-T19 
d/w^z. o't n i z 

Chromosome 6, 

complete 

sequence. 


0.7 


RNLTBP2J 


Rattus norvegicus 
rni\iN/\ lur l, i or-z jik.c 
protein; Latent TGF- 
beta binding protein-2 
like protein 


4.40E-05 


1/11 


D1 CTTO 


lAAflQ TV 

CIT978SKA1 
Homo sapiens 
genomic clone A- 
344G08. 


u.o / 


n\yfQP\/I 0 O 
UlVloC V L,Z_Z 


urosopnua 

melanogaster sevenless 
mRNA; Put; sevenless 
protein (AA 1-2510) 




; 142 


D13748 


Human mRNA 
for eukaryotic 
initiation factor 
4AL 


0.66 


MMU53563_1 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00016 


143 


S45791 


band 3-related 
protein=renal 
anion exchanger 
AE2 homolog 
[rabbits, New 
Zealand White, 
ileal epithelial 
cells, mRNA, 
jvoh nij. 


0.66 


POLS RUBV 
R 


STRUCTURAL 
POLYPROTEIN 
(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 
GLYCOPROTEINS El 
AND 

structural polyprotein - 
rubella virus (strain 
RA27/3 

vaccine)>GP:RUBCE2 
1 1 Rubella virus 
RA27/3 RNA for 
capsid, E2 and El 
proteins; Poly 


5.60E-05 


144 


M22462 


Chicken protein 
p54 (ets-1) 


0.66 


HSHP8PROT 
J 


H;sapiens mRNA for 
HP8 protein; HP8 


2.00E-06 
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mRNA, complete 
cds. 






peptide 




145 


U27999 


Human clone 
pDEL52A13 
HLA-C region 
cosmid 52 
genomic survey 
sequence. 


0.65 


CA18 HUMA 
N 


COLLAGEN ALPHA 
l(VIII) CHAIN 
PRECURSOR 
(ENDOTHELIAL 
CULLAubN)>rlR2:S 
15435 collagen alpha 
l(VHI) chain precursor 

human>GP:HSCOL8A 
i_i Human uuloA i 
mRNA for alpha 
l(VIII) collagen 


5.70E-06 


146 


M54787 


N.crassa mating 
type a-1 protein 
(mt a-1) gene, 
exons 1-3. 


0.64 


150717 


vacuolar H+-ATPase A 
subunit - chicken 
(fragment)>GP:GGU22 
078_1 Gallus gallus 
vacuolar H+-ATPase A 
subunit gene, partial 
cds 


0.0046 


147 


AC002094 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


0.63 


PVPVA1J 


P;vivax pval gene 


0.1 


148 


in?7fii 

UJZ l\Ji 


nacmopniius 
influenzae from 
bases 165345 to 
176101 (section 
16 of 163) of the 
complete 
genome. 


U.Oj 


17 A D t~l I_I A CT 

fAnu HAbJ 
N 


J-UXUACYL-[ALYL- 
CARR1ER PROTEIN] 
REDUCTASE (EC 
1.1.1.100) (3- 
KETOAC YL- ACYL 
CARRIER PROTEIN 
REDUCTASE)>PIR2: 
D64051 3-oxoacyl- 
acyl-carrier-protein] 
reductase (EC 
1.1.1.100)- 
Haemophilus 
influenzae (strain Rd 
KW20)>GP:HIU32701 
_7 Haemophilus 


2.00b- 12 


149 


Z37159 


T.brucei serum 
resistance 
associated (SRA) 
mRNA for VSG- 
ike protein. 


0.61 


<NONE> 


<NONE> 


<NONE> 
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150 


ArUZ/oOD 


Mus musculus 
Major 

Histocompatibilit 
y Locus class II 
region. 


U.Ol 


A>OM4 


chromokinesin - 
chicken>GP:GGU1830 
9_1 Gallus gallus 
chromokinesin mRNA, 
complete cds 




151 


U40938 


Caenorhabditis 
elegans cosmid 
D1009. 


0.61 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid cl3All; 
Unknown; 
SPAC13A11;03, 
unknown, len:210 


1.90E-24 


152 


116670 


Sequence 1 from 
patent US 
5476781. 


0.59 


CELF21 F8 J7 


Caenorhabditis elegans 
cosmid F21F8; Similar 
to eukaryotic aspartyl 
proteases 


0.39 


153 


Z84468 


Human DNA 
sequence *** 
SEQUENCING 

TXT DD/""W"lDUOO 

*** from clone 
299D3;HTGS 
phase 1 . 


0.59 


CLG1 YEAS 
T 


CYCLIN-LIKE 
PROTEIN 

CLG1>PIR2:S37607 
cycl in-like protein 
YGL215w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCYG 
L215W__1 Sjcerevisiae 
chromosome VII 
reading frame ORF 
YGL215w>GP:YSCC 
LG1CPRJ 
Saccharomyces 
cerevisiae cyclin-like 
protein (CLGl)gene 


0.0015 


154 


U00054 


Caenorhabditis 
elegans cosmid 
K07E12. 


0.57 


<NONE> 


<NONE> 


<NONE> 


155 


M21207 


Synthetic SV40 T 
antigen mutant 
pseudogene, 3 f 
end. 


0.57 


1CJL2 


cathepsin L (EC 
3.4.22.15) mutant 
(F(78P)L, C25S, 
T110A,E176G, 
D178G), fragment 2- 
human ( 


0.43 
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156 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds. 


0.56 


AC002 125_4 


Homo sapiens DNA 
from chromosome 19- 
cosmid F25965, 
genomic sequence, 
complete sequence; 
F25965J; 

Hypothetical 35;3 kDa 
protein similar to 
GTPase-activating 
proteins and orO from 


0.6 


157 


| M86352 


Stigmatella 
aurantiaca reverse 
transcriptase (163 
RT) gene, 
complete cds. 


0.56 


AC002398_4 


Human DNA from 
chromosome 19- 
speciflc cosmid 
F25965, genomic 
sequence, complete 
sequence; F25965_3; 
Hypothetical 96 kDa 
human protein similar 
to alpha chimaerin; 
Hypothetical 
protem>GP:AC002398 
_4 Human DNA from 
chromosome 19- 
specific cosmi 


4.50E-06 


158 


AC003101 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 
pieces. 


0.54 


<NONE> 


<NONE> 


<NONE> 


159 


B12117 


F5L15-T7 IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 


0.54 


CEF32H2__5 


Caenorhabditis elegans 
cosmid F32H2, 
complete sequence; 
F32H2;5; Similarity to 
Chicken fatty acid 
synthase (SW:P12276); 
cDNAESTykl6c2;5 
comes from this gene; 
cDNA EST ykll3h6;5 
comes 


1 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


160 


AE000664 


Mus musculus 
TCR beta locus 
from bases 
250554 to 501917 
(section 2 of 3) of 
the complete 
sequence. 


0.54 


CET01G9_6 


Caenorhabditis elegans 
cosmid TO 1G9, 
complete sequence; 
T01G9;4; CDNA EST 
yk29b7;5 comes from 
this gene 


0.84 


. 161 


B12117 


F5L15-T7IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 


0.54 


A39718 


nicotinic acetylcholine 
receptor alpha chain - 
marbled electric ray 
(fragments) 


0.27 ; 


162 


Z71261 


Caenorhabditis 
elegans cosmid 
F21C3, complete 
sequence. 


0.5 


KDGE DRO 
ME 


EYE-SPECIFIC 
DIACYLGLYCEROL 
KINASE (EC 
2.7.1.107) (RETINAL 
DEGENERATION A 
PROTEIN) 
(DIGLYCERIDE 
KINASE) 

(DGK)>GP:DRODAG 
K_J r ruit tly mRNA 
for diacylglycerol 
kinase, complete cds 


4.60E-05 


163 


M61831 


Human S- 
aoenosyinomocys 
teine hydrolase 
(AHCY)mRNA, 
complete cds. 


0.49 


P2C2_ARAT 

14 


PROTEIN 

(EC 3.1.3.16) 
(PP2C)>PIR2:S55457 
phosphoprotein 
phosphatase (EC 
3.1.3.16) 2C- 
Arabidopsis 
thaliana>GP:ATHPP2 
CA_1 Arabidopsis 
thaliana mRNA for 
protein phosphatase 2C 


5.60E-08 


164 


U42608 


Glycine max 
clathrin heavy 
chain mRNA, 
complete cds. 


0.48 


<NONE> 


<NONE> 


<NONE> 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

*7 A ¥ FTP 

VALUE 


165 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase 1 . 


0.47 


PYRD BACS 

! U 


DIHYDROOROTATE 

DEHYDROGENASE 

(EC 1.3.3.1) 

(DIHYDROOROTATE 

OXIDASE) 

(DHODEHASE)>PIRl 

:H39845 

dihydroorotate oxidase 
(EC 1.3.3.1) -Bacillus 
subtilis>GPN:BSUB00 
09_25 Bacillus subtilis 
complete genome 
(section 9 of 21): from 
1598421 to 1807200; 


0.002 


166 


AC000044 


Human 
Chromosome 
22ql3 Cosmid 
Clone p76el0, 
complete 
sequence. 


0.47 


MATK MAR 
PO 


PROBABLE INTRON 
MATURASE>PIR2:A 
05034 hypothetical 
protein 370i - liverwort 
(Marchantia 
polymorpha) 
chloroplast>GP:CHMP 
XX_21 Liverwort 
Marchantia 

polymorpha chloroplast 
genome DNA; 
ORF370i 


0.0011 ; 


167 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial). 


0.47 


S45361 


LRR47 protein - fruit 
fly (Drosophila 
melanogaster)>GP:DM 
LRR47J 

D;melanogaster mRNA 

IOr LKK4 / 


5.30E-07 


168 


Z67035 


Rsapiens DNA 
segment 
containing (CA) 
repeat; clone 
AFM323yfl; 
single read. 


0.45 


JQ2246 


22.5K cathepsin D 
inhibitor protein 
precursor - 

potato>GP:POTCATH 
D_l Potato cathepsin D 
inhibitor protein 
mRNA, complete cds 


0.79 


169 


Z93042 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase 1. 


0.44 


SMU31768J 


Schistosoma mansoni 
elastase gene, 3045 bp 
clone, complete cds 


0.0022 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DJbiSCKlr I ION 


V 

VALUE 


170 


LI 1172 


Plasmodium 
falciparum RNA 
polymerase I 
gene, complete 
cds. 


0.43 


HUMPKD1G0 
8_1 


Homo sapiens 
polycystic kidney 
disease (PKDl)gene, 
exons 43-46; Polycystic 
kidney disease 1 
protein 


1 


171 


Z95889 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
211A9; HTGS 
phase 1. 


0.43 


A09811J 


R;norvegicus mRNA 
for BRL-3A binding 
protein; Author-given 
protein sequence is in 
conflict with the 
conceptual translation 


0.00083 


172 


U32772 


Haemophilus 
influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.43 


YPT2 CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A4 


2.50E-28 


173 


Z99281 


Caenorhabditis 
elegans cosmid 
Y57G11C, 
complete 
sequence. 


0.42 


PTU19464J 


Paramecium tetraurelia 
outer arm dynein beta 
heavy chain gene, 
complete cds 


1 


174 


X04571 


Human mRNA 
for kidney 
epidermal growth 
factor (EGF) 
precursor. 


0.42 


YEK9 YEAS 
T 


HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37__20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 


0.99 



195 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest 






Nearest 








Neighbor 
(BlastN vs. 
Genbank) 






Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


175 


U32772 


Haemophilus 
influenzae from 

11141 UWl 1 a *4A W 11 Will 

bases 9548 19 to 
966363 (section 
87 of 163) of the 
complete 
genome. 


0.41 


YPT2_CAEE 
L 


HYPOTHETICAL 21.6 
KD PROTEIN 
F37A4.2 IN 
CHROMOSOME 
III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A4 


7.80E-21 


176 


AC002053 


Human 
Chromosome 
9p22 Cosmid 
Clone 92f5, 
complete 
sequence. 


0.4 


HSU33837J 


Human glycoprotein 
receptor gp330 
precursor, mRNA, 
complete cds 


1 


177 


U88309 


Caenorhabditis 
T23B3. 


0.4 


DROMTTGN 
C 1 


Drosophila 
m p! anop'S^ter 

mitochondrial 
cytochrome c oxidase 
subunit I (COI) gene, 5* 
end, Trp-, Cys-, and 
Tyr-tRNA genes, 
NADH dehydrogenase 
subunit 2 (ND2) gene, 
3' end 


0.99 


178 


M34025 


Human fetal Ig 
heavy chain 
variable region 
(clone M44) 
mRNA, partial 
cds. 


0.39 


DNA2 YEAS 
T 


DNA REPLICATION 
HELICASE 
DNA2>P1R2:S48904 
probable purine 
nucleotide-binding 
protein YHR164c- 
yeast (Saccharomyces 
cerevisiae)>GPN:YSC 
H9986J 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9986; 
Dna2p: DNA 
replication helicase; 
YHR1640GP: 


1 


179 


AC002395 


Homo sapiens; 
HTGS phase 1, 
127 unordered 
pieces. 


0.39 


VV_MUMPE 


NONSTRUCTURAL 
PROTEIN V 
(NONSTRUCTURAL 
PROTEIN NS1) 


0.11 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


180 


AC003101 


*** . 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 1 7 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 
pieces. 


0.39 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D1044.2 IN 
CHROMOSOME 
III>GP:CELD1044_4 

CapnorhahHitis plppans 

cosmid D1044 


0.0001 


181 


Z54335 


Human DNA 
sequence from 
cosmid L17A9 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. Contains 
VNTR and a CpG 
island. 


0.39 


HUMNFAT3 
A_l 


Homo sapiens NF-AT3 
mRNA, complete cds 


1.60E-06 


182 


U95743 


Homo saniens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


0.38 


CEZC434 6 


Caenorhabditis eleeans 

VUvlivl VlvgUJU 

cosmid ZC434, 
complete sequence; 
ZC434;6; CDNA EST 
CEESO02F comes 
from this gene; cDNA 
EST CEESS60F comes 
from this gene 


0.18 


183 


AC001229 


Sequence of BAC 

F5I14from 

Arabidopsis 

thaliana 

chromosome 1, 

complete 

sequence. 


0.34 


HSOCAMJ 


H;sapiens mRNA for 
immunoglobulin-like 
domain-containing 1 
protein 


0.051 


184 


X01703 


Human gene for 
alpha-tubulin (b 
alpha 1). 


0.33 


NTC3 MOUS 
E 


NEUROGENIC 
LOCUS NOTCH 3 
PROTEIN>PIR2:S453 
06 notch 3 protein - 
mouse>GP:MMNOTC 
_1 M;musculus mRNA 
for Notch 3 


0.012 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


185 


Z82189 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
170A21;HTGS 

piidbc 1 . 


0.31 


LG106J 


Lemna gibba 
negatively light- 
regulated mRNA 
(Lgl06); Second 
longest ORF (2) 


0.27 


186 


Z98051 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
501A4; HTGS 
phase 1 . 


0.3 


S34960 


NADH dehydrogenase 
(ubiquinone) (EC 
1.6.5.3) chain 5- 
Crithidia oncopelti 
mitochondrion 
(SGC6)>GP:MICOCN 
NR_3 Crithidia 
oncopelti mitochondrial 
ND4,ND5,COI, 12S 

riDUbUIIlal rvi>/\ gCIlcb 

for NADH 

dehydrogenase subunit 
4/5, cytochrome 
oxidase subun 


0.25 


187 


Z98749 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
449017; HTGS 
phase 1. 


0.3 j 


SCKC_LEIQ 
H 


CHARYBDOTOXIN 
(CHTX) (CHTX- 
LQ1)>PIR2:A60963 
charybdotoxin 1 - 
scorpion (Leiurus 
quinquestriatus)>3D:2 
CRD Charybdotoxin 
(nmr, 12 structures) - 
scorpion (Leiurus 
quinquestriatus) 


0.12 


188 


X96763 


C.albicans CDC4 
gene. 


0.29 


CECC4_1 


Caenorhabditis elegans 
cosmid CC4, complete 
sequence; CC4;a; 
Protein predicted using 
Genefinder; 
preliminary prediction 


1.30E-17 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SE< 
ID 


3 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


189 


U38804 


Porphyra 

purpurea 

chloroplast 

genome, 

complete 

sequence. 


0.28 


HIVHCDR3C 
J 


Human 

immunodeficiency 
virus type 1 heavy- 
chain complemetarity- 
determining region 3 
mRNA (clone 11), 
partial cds; Heavy- 
chain complementarity- 
determining region 3 
(CDR3) from HIV 
gpl20- 

>GP:HIVHCDR3I_1 
Human 

immunodeficiency 
virus type 1 he 


1 


190 


U20657 


Human ubiquitin 
protease (Unph) 
proto-oncogene 
mRNA, complete 
cds. 


0.28 


HSU20657J 


Human ubiquitin 
protease (Unph) proto- 
oncogene mRNA, 
complete cds 


5.60E-12 


191 


AC002037 


Human 

Chromosome 1 1 

Overlapping 

Cosmids 

cSRL72g7 and 

cSRL140b8, 

complete 

sequence. 


0.27 


VRP1 YEAS 
T 


VERPROLIN>GP:SC 
VERPRLJ 
S;cerevisiae (A364) 
gene for verprolin 


2.00E-11 


192 


U58748 


Caenorhabditis 
elegans cosmid 
ZK180. 


0.27 


EXLP TOBA 
C 


PISTIL-SECIFIC 
EXTENSIN-LIKE 
PROTEIN 
PRECURSOR 
(PELP)>PIR2:JQ1696 
pistil extensin-like 
protein precursor (clone 
pMG15)- common 
tobacco>GP:NTPMG 1 
5_1 N;tabacum mRNA 
for pistil extensin like 
protein 


4.10E-12 


193 


Z68013 


Caenorhabditis 
elegans cosmid 
W02H3, 
complete 
sequence. 


0.26 


<NONE> 


<NONE> 


<NONE> 
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Nearest 






Nearest 








Neighbor 
(BlastN vs. 
Genbank) 






Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


194 


AF017042 


Dictyostelium 
discoideum LTR- 
retrotransposon 
Skipper, partial 
genomic 
sequence, 5' end. 


0.26 


SPBC31F10 1 
4 


S;pombe chromosome 
II cosmid c31F10; 
Hypothetical protein; 
SPBC31F10;l4c, 
unknown, len:1586aa, 
some similarity eg; to 
YJR140C, 
YJ9H_YEAST, 
P47171, involved in 
cell cycle regulation 


1 


195 


B03174 


cSRL-16e2-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
cSRL-16e2. 


0.26 


CELC30E1J7 


Caenorhabditis elegans 
cosmid C30E1 


0.38 


196 


X70810 


E.gracilis 
chloroplast 
complete 
genome. 


0.25 


CEK10H10_8 


Caenorhabditis elegans 
cosmid K10H10, 
complete sequence; 
K10H10;k; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.98 


197 


U80024 


Caenorhabditis 
elegans cosmid 
C18B10. 


0.25 


MMAF001794 
_1 


Mus musculus Treacher 
Collins Syndrome 
protein (Tcofl)mRNA, 
complete cds; Putative 
nucleolar 

phosphoprotein; similar 
to Homo sapiens 
Treacher Collins 
syndrome TCOF1 
protein 

encoded>GP:MMAF00 
1794_1 Mus musculus 
Treacher Collins 
Syndrome p 


0.017 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


198 


AC000591 


Drosophila 
melanogaster 
(subclone 9 j*3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 
I 


HYPOTHETICAL 64.6 
KD PROTEIN IN 
MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 
7 Escherichia coli , 
mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0.00068 














199 


AC000591 


Drosophila 
melanogaster 
(subclone 9_g3 
from PI DS01486 
(D32)) DNA 
sequence, 
complete 
sequence. 


0.25 


YHGE ECOL 
I 


HYPOTHETICAL 64.6 
KD PROTEIN IN 
MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000415_ 
7 Escherichia coli , 
mrcA, yrfE, yrfF, yrfG, 
yrfH, yrfl 


0.00068 














200 


Z99571 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
388N15;HTGS 
phase 1 . 


0.24 


YA53 SCHP 
O 


HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11J 
S;pombe chromosome I 
cosmid cl3All; 
Unknown; \ 
SPAC13A11;03, 
unknown, len: 210 


0.017 


201 


U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.24 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 

202 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


AC003061 


*** 

SEQUENCING 
IN PROGRESS 
*** Mouse 
Chromosome 6 
BAC clone 
b245cl2; HTGS 
phase 2, 8 
ordered pieces. 


0.23 


CGI HUMA 
N 


CGI 

PROTEIN>GP:HSU46 
023 J Human Xq28 
mRNA, complete cds; 
Orf 


0.00078 


203 


AF009420 


Homo sapiens 
microsatellite 
sequence in the 
HNF3agene. 


0.22 


PN0675 


collagen alpha 

1 (XVIII) chain - mouse 

(fragment)>GP:MUSC 

OLLAGJ Mouse 

mRNA for collagen, 

partial cds 


0.00072 


204 


B18861 


F20C18-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F20C18. 


0.22 


TFDP00659 


- Polypeptides entry 
for factor PR 


0.0003 


205 


U00672 


Human 
interleukin-10 
receptor mRNA, 
complete cds. 


0.22 


TFDP00900 


- Polypeptides entry 
for factor Oct-2.5 


1.00E-05 


206 


X52105 


Dictyostelium 
discoideum SP60 
gene for spore 
coat protein. 


0.18 


<NONE> 


<NONE> 


<NONE> 


207 


L07628 


Saccharopolyspor 
a erythraea 
insertion 

sequence IS 1136, 
copy B, 3* end. 


0.17 


D88764J 


Rana catesbeiana 
mRNA for alpha 2 type 
I collagen, complete 
cds 


0.00021 


208 


Z49631 


S.cerevisiae 
chromosome X 
reading frame 
ORF YJR131w. 


0.16 


YSCDAL1A 
1 


Saccharomyces 
cerevisiae alantoinase 
(DAL1) gene, complete 
cds 


1 


209 


Z87893 


F.rubripes GSS 
sequence, clone 
043C17aB8. 


0.16 


CELC27A12 
8 


Caenorhabditis elegans 
cosmid C27A12; 
Partial CDS; this gene 
begins in the 
neighboring clone; 
coded for by C; elegans 
cDNAykl27fl;3; 
coded for by C; elegans 
cDNA ykl27fl;5 


1.30E-O7 



202 



WO 99/33982 



Table 2 



PCT/US98/27610 



III!! If 
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Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastXvs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


210 


U92852 


Rhoiptelea 
chiliantha 
maturase (matK) 
gene, chloroplast 
gene encoding 
chloroplast 
protein, complete 
cds. 


0.15 


SEU40259_5 


Staphyloccous 
epidermidis 
trimethoprim resistance 
plasmid pSK639; Orf53 


0.95 


211 


X62620 


B.mori Abd-A 
gene homeobox. 


0.15 


ATAP22J6 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I AP2 contig 
fragment No; 2; 
Hypothetical protein; 
Similarity to NADH 
dehydrogenase, 
Chondrus crispus; 
MNOS:S59107 


0.75 


212 


J02079 


epstein-barr virus 
simple repeat 
array (ir3). 


; 0.15 


A38346 


ultra-high-sulfur 
keratin 1 - 

mouse>GP:MUSSERl 
_1 Mouse serine 1 ultra 
high sulfur protein 
gene, complete cds; 
Putative 


7.50E-05 


213 


M35027 


Vaccinia virus, 

complete 

genome. 


0.14 


MTF1 FUSN 
U 


MODIFICATION 
MET HYLASE FNUDI 
(EC 2.1.1.73) 
(CYTOSINE- 
SPECIFIC 

METHYLTRANSFER 
ASE FNUDI) 
(M.FNUDI) 


0.87 


214 


AC003058 


*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'IGF 
BAC 'F27F23' 
genomic 
sequence near 
marker 
•CIC06E08'; 
HTGS phase 1,8 
unordered pieces. 


0.14 


HEXAJ31CDI 


BETA- 

HEXOSAMINIDASE 
ALPHA CHAIN 
PRECURSOR (EC 
3.2.1.52) (N-ACETYL- 
BETA- 

GLUCOSAMINIDASE 
) (BETA-N- 
ACETYLHEXOSAMI 
NIDASE)>PIR2:A307 
66 beta-N- 

acetylhexosaminidase 
(EC 3. 2. 1.52) A 
precursor - slime mold 
(Dictyostelium 


0.006 
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SEQ 
ID 


ACCESSION 
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P 

VALUE 


ACCESSION 


DESCRIPTION 


p 

VALUE 












discoideum)>GP:DDlN 
AGAJ D;d 




215 


AC001229 


Sequence ofBAC 

F5I14from 

Arabidopsis 

thaliana 

chromosome 1, 

complete 

sequence. 


0.13 


A49281 


pol protein - simian T- 
cell lymphotropic virus 
type 1, STLV-1 (isolate 
Bab34) 

(fragment)>GP:STVB 
ABPOLA_l Simian T- 
cell leukemia virus 
PCR derived (pol) 
gene, partial sequence 
BAB34POL; Bases 
4779-4918 EMBL 
ATK numbering 
system; BAB34POL 


0.77 


216 


U46067 


Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 


0.12 


S70663 


lectin heavy chain, N- 
acetylgalactosamine- 
specific - Entamoeba 
histolytica 

(fragment)>GP:EHU33 
443_1 Entamoeba 
histolytica GalNAc 
lectin heavy subunit 
(hgl4) gene, partial cds; 
N-acetylgalactosamine 
adherence lectin heavy 
subunit 


0.8 


217 


ACO00380 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 3 
pacpDJ70ill; 
HTGS phase 1,2 
unordered pieces. 


0.12 


ATFCA8J9 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
Unnamed protein 
product 


0.64 
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(BlastN vs. 
Gen bank) 
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(BlastX vs. 
Non- 
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Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


218 


X61207 


A.brasilense hisB, 

H, A,FandE 

genes for 

imidazole 

glycerolphosphat 

e dehydratase, 

glutamine 

amidotransferase, 

phosphorybosilfo 

rmimino-5- 

amino- 

phosphorybosil- 
4- 

imidazolecarboxa 
mide isomerase, 
cyclase and 
phosphorybosil- 
AMP- 

cyclohydrolase. 


0.12 


OCCL02J 


Ojcircumcincta colost- 
2 gene; Cuticular 
collagen 


0.0074 


219 


AFO 14259 


HIV-1 Patient 
1088 from 
Edinburgh, MA- 
pl7 (gag) gene, 
partial cds. 


0.11 


DMU88570J 


Drosophila 
melanogaster CREB- 
binding protein 
homolog mRNA, 
complete cds; CBP 


1 


220 


AC000636 


Drosophila 
melanogaster 
(subclone 2 ell 
from PI DS07660 
(D44))DNA 
sequence, 
complete 
sequence. 


0.11 


A64829 


hypothetical protein in 
dmsC 3' region - 
Escherichia coli (strain 
K- 

12)>GP:ECAE000192_ 
1 Escherichia coli , 
ycaD, ycaK, pflA, pflB, 
focA genes from bases 
944908 to 955952 
(section 82 of 400) of 
the complete genome; 
Hypothetical protein in 
dmsC 


0.051 


221 


AC002428 


Human BAC . 
clone GS039E22 
from 5q31, 
complete 
sequence. 


0.11 


HSNMYC2J 


Human N-myc gene 
exon 2; Put; N-myc 
protein (aa 1-263) (953 
is 1st base incodon) 


0.00014 


222 


L40949 


Homo sapiens 
(clone AT7-5eu) 
opioid-receptor- 
like protein 
mRNA, 5' end. 


0.11 


CEUNC93_2 


C;elegans unc-93 gene; 
Protein 2 


1.20E-13 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


223 


AL008636 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
722E9; HTGS 
phase 1. 


0.1 


XELCOL2A1 
AJ 


Xenopus laevis alpha- 1 
collagen type II' 
mRNA, complete cds; 
Alpha-1 type IV 
collagen 


2.60E-06 


224 


D86993 


Human (lambda) 
DNA for 
immunoglobulin 
light chain. 


0.1 


CELM02B7_2 


Caenorhabditis elegans 
cosmid M02B7 


1.80E-09 


225 


AC002539 


Homo sapiens 
chromosome 17, 
clone 195o20, 
complete 
sequence. 


0.098 


MTCY7D1 1 
17 


Mycobacterium 
tuberculosis cosmid 
Y7D11; Unknown; 
MTCY07Dll;17c; 
unknown, len: 1 86 aa, 
FASTA best: Ql 0390 
Y009_MYCTU 
hypothetical 3 1 ;0 KD 
protein MTCY190;09C 
(299 aa) opt: 355 z- 
score: 316;8 


0.026 


226 


M88165 


Human inter- 
alpha-trypsin 
inhibitor light 
chain (ITI) gene, 
exon 1 . 


0.096 


A54161 


ryanodine-binding 
protein alpha form - 
bullfrog>GP:D21070J 
Rana catesbeiana 
mRNA for bullfrog 
skeletal muscle calcium 
release channel 
(ryanodine receptor) 
alpha isoform(RyRl ), 
complete cds; 
Ryanodine receptor 
alpha isoform 


1 


227 


Z92851 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39G8; HTGS 
phase 1 . 


0.082 


CYA7 BOVI 
N 


ADENYLATE 
CYCLASE, TYPE VII 
(EC 4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE) 


0.3 
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K 
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Nearest 
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(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 
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DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


228 


L00638 


Arabidopsis 
thaiiana ubiquitin 
conjugating 
enzyme exons 2- 
4. 


0.072 


NUCM TRY 
BB 


NADH-UBIQUINONE 
OXIDOREDUCTASE 
49 KD SUBUNIT 
HOMOLOG (EC 
1.6.5.3) (NADH 
DEHYDROGENASE 
SUBUNIT 7 
HOMOLOG)>PIR2:A3 
5693 NADH 
dehydrogenase (EC 
1.6.99.3) chain 7- 
Trypanosoma brucei 
mitochondrion (SGC6) 


0.24 


229 


U49169 


Dictyostelium 
discoideum V- 
ATPase A 
subunit (vatA) 
mRNA, complete 
cds. 


0.071 


MMU65594J 


Mus musculus Brca2 
mRNA, complete cds; 
Similar to human breast 
cancer susceptibility 
gene BRCA2; Allele: 
wild type; putative 
tumor suppressor 


1 


230 


AF001549 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987SK- 
270G1 complete 
sequence. 


0.07 


PM22 HUMA 
N 


PERIPHERAL 
MYELIN PROTEIN 22 
(PMP- 

22)>PIR2:JN0503 
peripheral myelin 
protein 22 - 
human>GP:HUMGAS 
3X_1 Human 
peripheral myelin 
protein 22 (GAS3) 
mRNA, complete 
cds>GP:HUMPMP22_ 
1 Human peripheral 

mRNA, complete 
cds>GP:HUMPMP22 


0.0078 


231 


L36829 


Mus musculus 
alphaA-crystallin- 
binding protein I 
(AlphaA- 
CRYBPl)gene, 
complete cds. 


0.066 


<NONE> 


<NONE> 


<NONE> 


232 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone llq!3; 


0.058 


CEZK863J 


Caenorhabditis elegans 
cosmid ZK863, 
complete sequence; 
ZK863;2; Similar to 
collagen 


1 
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Nearest 
Neighbor 
(BlastX vs. 
Non- 
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Proteins) 


SEQ 
ID 
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DESCRIPTION 


P 

XT A ¥ TT1? 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VAT IIP 






HTGS phase 1, 
10 unordered 
pieces. 










233 


AC000159 


*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone llql3; 
HTGS phase 1, 
10 unordered 
pieces. 


0.058 


CAC2 HAEC 
O 


CUTICLE 
COLLAGEN 2C 
(FRAGMENT)>GP:H 
AECOL2CJ 
H;contortus collagen 
2C mRNA, 3'end 


1.20E-08 


234 


Z23908 


H. sapiens 
(D5S630)DNA 
segment 
containing (CA) 
repeat; clone 
AFM268zd9; 
single read. 


0.057 


VEU34999J 


Venezuelan equine 
encephalitis virus 
nonstructural and 
structural polyprotein 
genes, complete cds; 
Nonstructural 
polyprotein; Internal 
stop codon, 

readthrough occurs 5% 
of the time 


0.0002 


235 


B21875 


T3E8-Sp6TAMU 
Arabidopsis 
thaliana genomic 
clone T3E8. 


0.055 


YRR2 CAEE 
L 


HYPOTHETICAL 91.1 
KD PROTEIN R144.2 
IN CHROMOSOME 
III>GP:CELR144_7 
Caenorhabditis elegans 
cosmid R144; Coded 
for by C; elegans 
cDNA CEESP84R; 
coded for by C; elegans 
cDNA yk23c4;5; coded 
for by C; elegans 
cDNA yk44f9;5; coded 
for by C; eleg 


0.68 


236 


Z98303 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
140H19; HTGS 
phase 1. 


0.048 


AC002330J 


Arabidopsis thaliana 
BACT10P11, 
complete sequence; 
Putative zinc-finger 
protein; C2H2 Zn- 
finger signature from 
position 80 to 100 
[CEICNKGFQRDQNL 
QLHRRGH] 


0.99 
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Genbank) 
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SEQ 
ID 
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P 
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ACCESSION 


DESCRIPTION 


P 
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237 


D49911 


Thermus 
thermophilus 
UvrA gene, 
complete cds. 


0.044 


APP1 MOUS 
E 


AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(APLP)>PIR2:A46362 
amyloid precursor-like 
protein - 

mouse>GP:MUSAPLP 
_1 Mouse amyloid 
precursor-like protein 
mRNA, complete cds 


8.90E-06 


238 


D49911 


Thermus 
thermophilus 
UvrA gene, 
complete cds. 


0.044 


MMCOL18A1 
1_2 


Mus musculus alpha- 
l(XVIII) collagen 
(COL 18A1) gene, 
exons 40-43, complete 
cds 


1.60E-06 


239 


X78119 


P.amygdalus, 
Batsch (Texas) 
prul mRNA. 


0.042 


CA44 HUMA 
N 


COLLAGEN ALPHA 
4(IV) CHAIN 
PRECURSOR>PIRl:C 
GHU1B collagen alpha 
4(IV) chain precursor - 
human>GP:HSCOL4A 
4_1 H;sapiens mRNA 
for collagen type IV 
alpha 4 chain; Type IV 
collagen alpha 4 chain 


2.00E-06 


240 


U72877 


Rana catesbeiana 
L-epinephrine 
transporter 
mRNA, complete 
cds. 


0.041 


YRR6 MYCC 
A 


HYPOTHETICAL 33.0 
KD PROTEIN IN 
LICA 3'REGION(ORF 
R6)>PIR2:S42125 
hypothetical protein 3 - 
Mycoplasma 
capricolum 
(SGC3)>GP:MYCRP 
MH_6 M; capricolum 
rpmH, rnpA and licA 
gene; OrfR6 


0.0008 


241 


L39891 


Homo sapiens 
polycystic kidney 
disease- 
associated protein 
(PKDl)gene, 
complete cds. 


0.04 f 


MUC2 HUM 
AN 


MUCIN 2 
(INTESTINAL 
MUCIN 2) 
(FRAGMENTS) 


5.90E-05 


242 


L40390 


Candida glabrata 
ERG3 gene, 
complete cds. 


0.039 


G01763 


atrophin-1 - 
human>GP:HSU23851 
_1 Human atrophin-1 
mRNA, complete cds 


9.00E-07 
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Genbank) 
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Neighbor 
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Non- 
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ID 
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DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 




Dig 1 I J 


TAMU 
Arabidopsis 
thaliana genomic 
clone T2L16. 


0.038 


CELZK1248 
14 


Caenorhabditis eleeans 
cosmid ZK 1248 


1.60E-18 


244 


AC000030 


00175, complete 
sequence. 


0.033 


ATFCA8_40 


Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contie 
fragment No; 8; 
GlyceroI-3-phosphate 
permease homolog; 
Similarity to glycerol- 
3-phosphate permease - 
Haemophilus 
influenzae 


0.63 


245 


B10738 


F13G15-Sp6 IGF 
Arabidopsis 
thaliana genomic 
clone F13G15. 


0.032 


D87521_l 


Mus musculus DNA- 
PKcs mRNA, complete 
cds 


0.21 


246 


AF024503 


Caenorhabditis 
eleeans cosmid 
F31F4. 


0.03 


138344 


titin - human 


1 


247 


Z49888 


Caenorhabditis 
elegans cosmid 
F47A4, complete 
sequence. 


0.027 


KSU52064_1 


Kaposi's sarcoma- 
associated herpes-like 
virus ORF73 homolog 
gene, complete cds; 
Herpesvirus saimiri 
ORF73 

homolog>GP:KSU756 
98J78 Kaposi's 
sarcoma-associated 
herpesvirus long unique 
region, 80 putative 
ORFs and kaposin 
gene, complete cds; OR 


3.40E-10 


248 


Z83822 


Human DNA 
sequence from 
PAC 306D1 on 
chromosome X 
contains ESTs. 


0.025 


GRSB BACB 
R 


GRAMICIDIN S 
SYNTHETASE II 
(GRAMICIDIN S 
BIOSYNTHESIS 
GRSB PROTEIN) (EC 
6.-.-.-) 


1 


249 


Z94161 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
N102C10; HTGS 


0.025 


SI 6323 


hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHBl_l 
A;thaliana homeobox 
gene Athb-1 mRNA; 
Open reading frame 


0.0079 
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Nearest 
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SEQ 
ID 
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P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 






phase 1. 










250 


AC002094 


Genomic 
sequence from 
Human 17 
complete 
sequence. 


0.021 


S57447 


HPBRII-7 protein - 
human>GP:HSHPBRII 
4 1 H;sapiens HPBRII- 
4 

mRNA>GP:HSHPBRII 
7_1 H;sapiens HPBRII- 
7 gene 


8.20E-08 


251 


D79994 


Human mRNA 
for KIAA0172 
eene oartial cds 


0.021 


CER10H10J 


Caenorhabditis elegans 
cosmidRlOHIO, 
complete sequence; 
R11A8;7; Protein 
predicted using 
Genefinder; Similarity 
to Mouse ankyrin (PIR 
Acc; No; S37771); 
cDNA EST CEESX25F 
comes from this gene; 


7.00E-16 


252 


Z97635 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
438L4; HTGS 
phase 1. 


0.017 


CELW05H7_4 


Caenorhabditis elegans 
cosmid W05H7 


0.24 


253 


X84996 


X.laevis mRNA 
for selenocysteine 
tRNA acting 
factor (Staf). 


0.017 


JN0786 


integrin beta-4 chain 
precursor - mouse 


0.088 


254 


AC002543 


Human BAC 
clone RG300C03 
from 7q3 1.2, 
complete 
sequence. 


0.013 


MZLMTCYT 
BTJ 


Mendozellus isis 
mitochondrial NADH 
dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RNA- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.044 
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DESCRIPTION 


P 
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255 


U10401 


Caenorhabditis 
elegans cosmid 
T20B12. 


0.012 


MMMHC29N 
12 


Mus musculus major 
histocompatibility 
locus class HI 
region:butyrophilin-like 
protein gene, partial 
cds; Notch4,PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 
alpha, palmitoyl- 


0.069 j 


256 


LI 4593 


Saccharomyces 
cerevisiae protein 
phosphatase 
(PTC 1) gene, 
complete cds. 


0.011 


D86995J 


Human (gene 1) DNA 
for phosphatase 2C 
motif, partial cds 


2.20E-14 


257 


U62317 


Chromosome 
22ql3 BAC 
Clone 
CIT987SK- 
384D8 complete 
sequence. 


0.0093 


P2Y8 XENL 
A 


P2Y PUR1NOCEPTOR 
8 

(P2Y8)>GP:XLP2Y8_ 
1 X;laevis mRNA for 
P2Y8 nucleotide 
receptor 


0.89 


258 


D29655 


Pig mRNA for 
UMP-CMP 
kinase, complete 
cds. 


0.0075 


AF004858J 


Mus musculus platelet 
activating factor 
receptor mRNA, partial 
cds; PAF-receptor 


1 


259 


AF002992 


Homo sapiens 
cosmid from 
Xq28, complete 
sequence. 


0.0054 


FBN1 BO VI 
N 


FIBRILLIN 1 
PRECURSOR>PIR2:A 
55567 fibrillin I - 
bovine>GP:BOVXAA 
AA_1 Bos taurus 
mRNA, complete cds; 
Putative 


0.0004 


260 


B20752 


T19M2-T7 
TAMU 
Arabidopsis 
thaliana genomic 
clone T19M2. 


0.0043 


HSVT1IEPJ 


Feline herpesvirus type 
1 gene for immediate 
early protein, complete 
cds; Feline herpesvirus 
type 1 immediate early 
protein 


3.90E-05 
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261 


AB006699 


Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: MDJ22. 


0.0037 


YHV5 YEAS 
T 


HYPOTHETICAL 
143.6 KD PROTEIN 
IN SPO16-REC104 
INTERGENIC 
REGION>PIR2:S4675 
4 hypothetical protein 
YHR155w- yeast 
(Saccharomyces 
cerevisiae)>GPN:YSC 
H9666_15 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9666; 
Yhrl55wp; Similar to 
Sip3p (Snf 


0.077 


262 


Z99128 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
422H11;HTGS 
phase 1. 


0.0032 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


0.0087 


263 


B21848 


T2D2-Sp6 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2D2. 


0.0031 


B31794 


mdm-1 protein (clone 
cl03)- mouse 


1.00E-05 


264 


L33853 


Human germline 
immunoglobulin 
kappa chain 
variable region 
(Vk-IV subgroup) 
for anti-B- 
amyloid 

autoantibodies in 

Alzheimer's 

disease. 


0.0027 


B45550 


cytochrome b homolog 
- Plasmodium yoelii 


0.99 


265 


B36863 


HS-1042-A1- 
F01-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 824 
Col=l Row=K. 


0.0027 


YQK4 CAEE 
L 


HYPOTHETICAL 64.3 
KD PROTEIN 
C56G2.4 IN 
CHROMOSOME 
1II>GP:CELC56G2_2 
Caenorhabditis elegans 
cosmid C56G2 


0.81 
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P 
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P 
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266 


AC003041 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HCIT307A16; 
HTGS phase 1, 
10 unordered 
pieces. 


0.0024 


GLB4 LAMS 
P 


GIANT 

HEMOGLOBIN AIV 
CHAIN 

(FRAGMENT)>PIR2: 
SO 1810 hemoglobin 
AIV -tube worm 
(Lamellibrachia sp.) 
(fragment) 


0.94 


267 


AC002315 


Mouse BAC- 
146N21 

Chromosome X 

contains 

iduronate-2- 

sulfatase gene; 

complete 

sequence. 


0.0022 


MG42 TARM 
A 


SRY-RELATED 
PROTEIN MG42 
(FRAGMENT)>PIR3:I 
51369 Sry-related 
sequence - Tarentola 
mauritanica 
(fragment)>GP:TELM 
G42DNAJ Gecko 
MG42 gene, partial 
cds; Sry-related 
sequence 


0.99 


268 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


SCYJL204C 
1 


S;cerevisiae 
chromosome X reading 
frame ORF YJL204c 


1 


269 


AF016674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


CEM199J 


Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M 1 99 ;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.97 


270 


AFO 16674 


Caenorhabditis 
elegans cosmid 
C03H5. 


0.0015 


CEM199J 


Caenorhabditis elegans 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefinder; 
preliminary prediction 


0.97 


271 


Z54199 


L.esculentum 
DNA Ailsa craig 
encoding 1- 
aminocyclopropa 
ne-l-carboxylic 
acid oxidase. 


0.0015 


CELF20A1_5 


Caenorhabditis elegans 
cosmid F20A1; Coded 
for by C; elegans 
cDNA yk9gl;3; coded 
for by C; elegans 
cDNA yk9gl;5; coded 
for by C; elegans 
cDNA CEESU55F; 


0.11 
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weak similarity to 
putative 




272 


Z99943 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
313L4; HTGS 
phase 1, 


0.0014 


CEK08F8_5 


Caenorhabditis elegans 
cosmid K08F8, 
complete sequence; 
K08F8;5b 


0.93 


273 


S81083 


beta- 

ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton 
protein, beta - 
ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton protein 
{alternatively 
spliced, exon 10 
to 13 region} 
[human, 
Genomic, 1851 
nt, segment 3 of 
3]. 


0.0013 


MTCY277_7 


Mycobacterium 
tuberculosis cosmid 
Y277; Unknown; 
MTCY277;07c, 
unknown, len: 302 


0.0001 


274 


Z82174 


Human DNA 
sequence from 
cosmid B20F6 on 
chromosome 
22qll.2-qter. 


0.001 


FBLA HUM 
AN 


FIBULIN-1, 
ISOFORMA 
PRECURSOR>GP:HS 
FIBUAJ H;sapiens 
mRNAforfibulin-1 A 


0.00063 


275 


Z82215 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6802; HTGS 
phase 1. 


0.00079 


BFR1 SCHP 
O 


BREFELDIN A 
RESISTANCE 
PROTEIN>PIR2:S522 
39 hba2 protein - 
fission yeast 
(Schizosaccharomyces 
pombe)>GP:SPHBA2 
GENJ S;pombe hba2 
gene 


0.15 
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276 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.00071 


CX2_HEMHA 


CYTOTOXIN 2 
(TOXIN 12A) 


0.32 


277 


Z82204 


Human DNA 
sequence from 
clone J362G171. 


0.00054 


DMU34925_2 


Drosophila 
melanogaster DNA 
repair protein (mei-41) 
gene, complete cds, and 
THl gene, partial cds 


0.045 


278 


AC002530 


Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


0.00053 


CELT28F2_2 


Caenorhabditis elegans 
cosmid T28F2; Weak 
similarity to HSP90 


0.037 


279 


U91322 


Human 

chromosome 

16pl3 BAC clone 

CIT987SK-276F8 

complete 

sequence. 


0.00051 


CEW08D2_2 


Caenorhabditis elegans 
cosmid W08D2, 
complete sequence; 
W08D2;3; Protein 
predicted using 
Genefinder>GP:CEW0 
8D2_2 Caenorhabditis 
elegans cosmid 
W08D2; W08D2;3; 
Protein predicted using 
Genefinder 


0.26 


280 


D16986 


Human HepG2 
partial cDNA, 
clone 

hmd2b09m5. 


0.00037 


POLG PPVN 
A 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 KD 
PROTEIN; NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


0.48 


281 


U91318 


Human 
chromosome 
16pl3 BAC clone 
CIT987SK- 
962B4 complete 


0.00031 


<NONE> 


<NONE> 


<NONE> 
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sequence. 










282 


M93406 


Human dispersed 
Alu repeats and 
dispersed LI 
repeat. 


0.0003 


VG8_SPV4 


GENE 8 

PROTEIN>PIRl:G8BP 
SV gene 8 protein - 
spiroplasma virus 4 
(SGC3) 


0.23 


283 


AC002398 


Human DNA 
from 

chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, 
complete 
sequence. 


0 00021 


HMCA DRO 
ME 


HOMEOTIC 
CAUDAL 

PROTEIN>PIR2:A263 
57 homeotic protein 
Cad - fruit fly 
(Drosophila 
melanogaster)>GP:DR 
OCADA2J 
D;melanogaster caudal 
gene (cad) encoding a 
maternal and zygotic 
transcript, exon 2; 
Caudal 

protein>TFD:TFDP001 
59 - Polypeptides en 


0.021 


284 


AC002530 


Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
sequence. 


0.0002 


PL0009 


complement 
C3d/Epstein-Barr virus 
receptor precursor - 
human 


0.7 


285 


X01871 


Yeast 

mitochondrial 
ori(o) repeat unit 
of petite mutant 5 
(petite strain s- 
10/7/2). 


0.00015 


RVZMTCYT 
BT_1 


Reventazonia sp; 
mitochondrial NADH 
dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RNA- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 


0.73 


286 


U89984 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds. 


0.00015 


ACU89984J 


Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds; Similar 
to human 
transformation- 


4.20E-13 
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P 
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sensitive protein: 
SwissProt Accession 
Number P31948 




287 


AC002365 


Homo sapiens 
chromosome X 
clone U177G4, 
U152H5, 
U168D5, 174A6, 
U172D6,and 
U186B3from 
Xp22, complete 
sequence. 


0.00011 


S 10340 


DNA-directed RNA 
polymerase (EC 
2.7.7.6) - yeast 
(Kluyveromyces 
marxianus var. lactis) 


0.00062 


288 


AC002390 


Human DNA 
from overlapping 
chromosome 19- 
specific cosmids 
R30072 and 
R28588, genomic 
sequence, 
complete 
sequence. 


9.90E-05 


D86603J 


Mouse mRNA for Bach 
protein 1, complete cds; 
Bachl 


1 


289 


AC002980 


Homo sapiens; 
HTGS phase 1, 
34 unordered 
pieces. 


9.20E-05 


TRBKPCYB 
1 


Trypanosoma brucei 
kinetoplast 

apocytochrome b gene, 
complete cds 


0.52 


290 


M99412 


Human 
interleukin-8 
receptor (IL8RB) 
gene, complete 
cds. 


4.50E-05 


S28832 


microtubule-associated 
protein HI (clone 
KS3.1)-longfin squid 
(fragment) 


0.88 


291 


AC000120 


Human BAC 
clone RG161K23 
from 7q21, 
complete 
sequence. 


4.O0E-05 


SXSCRBAJ 


S;xylosus scrB and 
scrR genes; Sucrose 
repressor 


0.99 
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292 


AC003037 


Homo sapiens; 
HTGS phase 1, 
66 unordered 
pieces. 


3.40E-05 


S13569 


hypothetical protein 5 - 
Lactococcus lactis 
subsp. lactis insertion 
sequence 

1076>GP:LLTLEJ 
Lactococcus lactis 
DNAforthe 
transposon-like element 
on the lactose plasmid; 
ORF5 (AA 1 - 43) 


0.018 


293 


Z81512 


Caenorhabditis 
elegans cosmid 
F25C8, complete 
sequence. 


2.40E-05 


MUSDBPRC 
1 


Mus musculus DNA- 
binding protein Rc 
mRNA, complete cds; 
DNA binding protein 
Rc 


1 


294 


B16681 


343C3.TVB 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
343C03. 


1.10E-05 


COPP_YEAS 
T 


CO A TOMER BETA* 
SUBUNIT (BETA- 
COAT PROTEIN) 
(BETA- 

COP)>PIR2:B55123 
coatomer complex beta' 
chain - yeast 
(Saccharomyces 
cerevisiae)>GPN:SCY 
GL137WJ 
S;cerevisiae 
chromosome VII 
reading frame ORF 
YGL137w>GP:SCUll 
237_I Saccharomyces 
cerevisiae 


0.081 


295 


Z 16523 


H. sapiens 
(D9S158)DNA 
segment 
containing (CA) 
repeat; clone 
AFM073ybll; 
single read. 


1.00E-05 


MMSEMFJ 


M;musculus mRNA for 
semaphorin F; 
Smaphorin F 


0.78 


296 


Z49704 


S.cerevisiae 
chromosome XIII 
cosmid 8021. 


5.60E-06 


<NONE> 


<NONE> 


<NONE> 


297 


AC003071 


Human BAC 
clone BK085E05 
from 22ql2.1- 
qter, complete 
sequence. 


3.00E-06 


HSRCAER_1 


H;sapiens mRNA for 
red cell anion 
exchanger (EPB3, AE1, 
Band 3) 3' non-coding 
region 


0.21 
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298 


U20428 


Human SNC 19 
mRNA sequence. 


1.40E-06 


HUMMUC2A 
J 


Human mucin-2 gene, 
partial cds 


4.40E-06 


299 


U51903 


Human RasGAP- 
related protein 
(IQGAP2) 
mRNA, complete 
cds. 


6.60E-07 


IQGA HUMA 
N 


RAS GTPASE- 
ACTIVA TING-LIKE 
PROTEIN IQGAP1 
(P195)>PIR2:A54854 
Ras GTPase activating- 
related protein - 
human>GP:HUMIQG 
A_l Homo sapiens ras 
GTPase-acti vating-1 ike 
protein (IQGAP1) 
mRNA, complete cds; 
Amino acid feature: IQ 
calmodulin-binding do 


1.60E-14 


300 


AL000805 


F.rubripes GSS 
sequence, clone 
021G08aAl. 


4.70E-07 


MT13 MYTE 
D 


METALLOTHIONEIN 
10-111 (MT-10- 
III)>PIR2:S39418 
metallothionein 10-111 - 
blue mussel 


2.20E-10 


301 


AC003016 


Human BAC 
clone RG134C19 
from 8q21, 
complete 
sequence. 


4.30E-07 


SPC57A10_5 


S;pombe chromosome I 
cosmid c57A10; 
Unknown; 
SPAC57A10;05;c, 
unknown, len:606aa, 
similar to A; nidulans 
Q00659, sulfur 
metabolite repression 
control, (678aa), fasta 
scores, opt: 1355, 


0.00041 


302 


AC003089 


Human BAC 
clone 

RG180F08A, 

complete 

sequence. 


3.80E-Q7 


HPBPRECK 
1 


Hepatitis B virus type 
1 1 precore protein (pre- 
C region, C) gene, 5' 
end 


0.41 


303 


AC002074 


Human BAC 
clone GS056H18 
from 7q31-q32, 
complete 
sequence. 


2.40E-07 


A47021J 


Sequence 23 from 
Patent W09527787; 
Unnamed protein 
product; Author-given 
protein sequence is in 
conflict with the 
conceptual 

translation>GP:A5 1 260 

1 Sequence 23 from 
Patent W09614416; 
Unnamed protein 
product; Author-given 


0.0016 
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protein sequence is i 




304 


U04980 


Rattus norvegicus 
fetal troponin T 3 
(fetal TnT3) 
mRNA, partial 
cds. 


2.20E-07 


HUMFSHDJ 


Human 

facioscapulohumeral 
muscular dystrophy 
(FSHD) gene region, 
D4Z4 tandem repeat 
unit; ORF 


3.30E-08 


305 


U68704 


Human 
chromosome 
21q22.3 PI -clone 
3804 subclone 4- 
52. 


2.00E-07 


HHV6AGNM 
-96 


Human herpesvirus-6 
(HHV-6)U1102, 
variant A, complete 
virion genome; U88; 
Cys repeats; this loci is 
open in all six reading 
frames, part of IE-A 


2.70E-05 


306 


U51583 


Rattus norvegicus 
zinc finger 
homeodomain 
enhancer-binding 
protein- 1 (Zfhep- 
l)mRNA, partial 
cds. 


8.70E-08 


AF005370_67 


Alcelaphine 
herpesvirus 1 L-DNA, 
complete sequence; 
Putative immediate 
early protein; ORF73; 
similar to H; saimiri 
and KSHV ORF73 


6.10E-07 


307 


M80206 


Mus domesticus 
poliovirus 
receptor homolog 
(MPH) mRNA, 
complete cds. 


8.10E-08 


153960 


PRR2 alpha -human 


1.70E-28 


308 


M60854 


Human ribosomal 
proteinS 16 
mRNA, complete 
cds. 


5.70E-08 


OLVPOLJ 


Caprine arthritis 
encephalitis virus 
(isolate OVLV-Nl)pol 
protein gene, 3' end of 
cds; Nt 2497-2695 
from CAEV Co 


0.27 


309 


U82828 


Homo sapiens 
ataxia 

telangiectasia 
(ATM) gene, 
complete cds. 


1.50E-08 


C40201 


artifact-warning 
sequence (translated 
ALU class C) - human 


0.00044 
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310 


Z83836 


Human DNA 
sequence from 
PAC lllJ24on 
chromosome 
22ql2-qter 
contains ESTs. 


1.40E-08 


HSU64473J 


Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial 
cds>GP:HSU64498_l 
Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial cds 


0.34 j 


311 


Z50029 


Caenorhabditis 
elegans cosmid 
ZC504, complete 
sequence. 


1 .40E-08 


MMU88984 1 


Mus musculus NIK 
mRNA, complete cds 


1.70E-50 


312 


AC002351 


Homo sapiens; 
HTGS phase 1, 
17 unordered 
pieces. 


1.20E-08 


D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp>GP:HSNCOL4 1 
HydraN-COL4 
mRNA for mini- 
collagen; No start 
codon 


0.02 


313 


B65763 


CIT-HSP- 
2023A12.TR 
CIT-HSP Homo 
sapiens genomic 
clone 2023A12. 


3.60E-09 


S18106 


type II site-specific 
deoxyribonuclease (EC 
3.1.21.4) Abrl- 
Azospirillum brasilense 


0.045 


314 


Z93021 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
516C23;HTGS 
phase 1. 


2.00E-09 


AB001684 13 
4 


Chlorella vulgaris C-27 
chloroplast DNA, 
complete sequence; 
RNA polymerase 
gamma subunit 


0.6 


315 


D88035 


Rat mRNA for 
glycoprotein 
specific UDP- 
glucuronyltransfe 
rase, complete 
cds. 


1.50E-09 


D88035J 


Rat mRNA for 
glycoprotein specific 
UDP- 

glucuronyltransferase, 
complete cds 


1.00E-33 



222 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest 






Nearest 






I;- - .:, ' . 11": 


Neighbor 
(BlastN vs. 
Genbank) 






Neighbor 
(BlastXvs. 
Non- 
Redundant 
Proteins) 






SEQ 


ACCESSION 


DESCRIPTION 


P 


ACCESSION 


DESCRIPTION 


P 


ID 






VALUE 






VALUE 


316 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 


1.30E-10 


VGF1JBVB 


Fl 

PR0TE1N>PIR1:VF1H 
Bl Fl protein - avian 
infectious bronchitis 
virus (strain 
Beaudette)>GP:IBACG 
B_l Avian infectious 
bronchitis virus pol 
protein, spike protein, 
small virion-associated 
protein, membrane 
protein, and 
nucleocapsid protein 
gen 


1 


317 


B04719 


cSRL-42G12-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
CSRL-42G12. 


7.90E-11 


JC5238 


galactosylceramide-like 
protein, GCP - human 


0.31 


318 


M73506 


Mouse Tcp-lOc (t 
allele) gene. 


2.80E-11 


A39487 


T-complex protein 10a 
(allele 129) -mouse 


4.10E-16 


319 


U71148 


Human Xq28 
cosmids U225B5 
and U236A12, 
complete 
sequence. 


1.20E-11 


A56547 


sex-peptide precursor - 
Drosophila suzukii 


0.4 


320 


Z95116 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57G9; HTGS 
phase 1 . 


9.90E-13 


ALU2 HUM 
AN 


!!!! ALU 

SUBFAMILY SB 
WARNING ENTRY 
!!!! 


0.0017 


321 


M64795 


Rat MHC class I 
antigen gene 
(RTl-u 
haplotype), 
complete cds. 


1.70E-14 


STCDROME 


SHUTTLE CRAFT 
PROTEIN>GP:DMU0 
9306_1 Drosophila 
melanogaster shuttle 
craft protein (stc) 
mRNA, complete cds; 
C-terminal 222 amino 
acids encode a novel 
single- stranded DNA 
binding domain 


1.40E-13 
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322 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


4.20E-15 


AF010403J 


Homo sapiens ALR 
mRNA, complete cds; 
Alternatively spliced; 
similarity to ALL-1 and 
Drosophila trithorax 


1 


323 


U12523 


Rattus norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence. 


2.90E-15 


SPBC30D10 
4 


S;pombe chromosome 
II cosmid C30D10; 
Hypothetical protein; 
SPBC30D10;04, 
unknown, len:148aa 


2.40E-09 


324 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18;HTGS 
phase 1. 


2.20E-15 


RPON HAL 
MA 


DNA-DIRECTED 
RNA POLYMERASE 
SUBUNITN(EC 
2.7.7.6)>PIR2:D41715 
DNA-directed RNA 
polymerase II chain 
RPBlOhomolog- 
Haloarcula 

marismortui>GP:HAL 
HMAENOA_4 
H;marismortui tRNA- 
Leu, HL29, HmaL13, 
HmaS9, OrfMMV, 
OrfMNA, 2- 
phosphoglycerate 
dehydr 


0.019 


325 


M86917 


Human oxysterol- 
binding protein 
(OSBP) mRNA, 
complete cds. 


1.60E-15 


CEF14H8_2 


Caenorhabditis elegans 
cosmid F14H8, 
complete sequence; 
F14H8;1; Similarity to 
Human oxysterol- 
binding protein 
(SW:OXYB_HUMAN) 


2.10E-18 


326 


AC001231 


Genomic 
sequence from 
Human 17, 
complete 
sequence. 


1.30E-15 


AC002397_3 


Mouse BAC284H12 
Chromosome 6, 
complete sequence; 
DRPLA 


0.0016 


327 


AL008626 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
1114G22; HTGS 
phase 1. 


5.30E-16 


TAU48227J 


Triticum aestivum 
soluble starch synthase 
mRNA, partial cds 


5.90E-05 
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328 


L04483 


Human ribosomal 
protein S21 
(RPS21)mRNA, 
complete cds. 


7.60E-17 


RS21 HUM A 
N 


40S RIBOSOMAL 
PROTEIN 
S21>PIR2:S34108 
ribosomal protein S21 - 
human>GP:SSZ840 1 5_ 
1 S;scrofa mRNA; 
expressed sequence tag 
(3'; clone cllglO); 40S 
ribosomal protein S21; 
Similar to human 40S 
ribosomal protein 
S21>GP:HUMRPS21X 
1 Human ribosomal 


1.40E-09 


329 


AB001899 


Homo sapiens 
PACE4 gene, 
exon 2. 


6.70E-17 


LRP1 HUMA 
N 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 

1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) (A2MR) 
(APOLIPOPROTEIN 
E RECEPTOR) 
(APOER)>PIR2:S0239 

2 LDL receptor-related 
protein precursor - 
human>GP:HSLDLRR 
L_l Human mRNA for 
LDL-recept 


1 


330 


Z98755 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18;HTGS 
phase 1. 


4.40E-17 


U97553_59 


Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Ribonucleotide 
reductase large 


0.06 


331 


AF017187 


Homo sapiens 
LTR HERV-K 
repetitive element 
fragment 
ltr_19_9a 
sequence. 


3.90E-18 


D84255J 


Ovophis okinavensis 
mitochondrial DNA for 
NADH dehydrogenase 
subunit 1, partial cds, 
Ile-tRNA, Pro-tRNA, 
Phe-tRNA,Gln- tRNA, 
Met-tRNA and control 
region (D-loop region); 
This cds 


0.007 



225 



wo 



99/33982 



Table 2 



PCT/US98/27610 







Nearest 
Neighbor 
(BJastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


ID 


ACCESSION 


DESCRIPTION 


1 P 
VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


332 


B36252 


HS-1038-A2- 
GOI-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 820 
CoI=2 Row=M. 


3.10E-18 


PGBM MOU 
SE 


BASEMENT 
MEMBRANE- 
SPECIFIC HEPARAN 
SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 
PRECURSOR (HSPG) 
(PERLECAN) 
(PLC)>PIR2:S18252 
heparan sulfate 
proteoglycan - 
mouse>GP:MUSPERP 
A_l Mouse perlecan 
mRNA, complete cds 


0.00015 


333 


D78255 


Mouse mRNA for 
PAP-1, complete 
cds. 


2.70E-18 


MUSPAP1J 


Mouse mRNA for 
PAP-1, complete cds 


3.50E-18 


334 


AC003046 


Human Xp22 
PACs RPC11- 
263P4 and 
RPC11-164K3 
complete 
sequence. 


1.40E-18 


CEC34F6J 


Caenorhabditis elegans 
cosmid C34F6; 
C34F6;1;CDNA EST 
yk46bl2;5 comes from 
this gene; cDNA EST 
yk44c4;5 comes from 
this gene; cDNA EST 
yk46bl2;3 comes from 
this gene 


0.0015 


335 


AC003002 


Human DNA 
from overlapping 
chromosome 19- 
specific cosmids 
R29515and 
R28253, genomic 
sequence, 
complete 
sequence. 


1.40E-18 


MUSZFPOJ 


Mouse mRNA for zinc 
finger protein, partial 
sequence 


1.30E-19 


336 


Y 15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


3.40E-19 


HS4U2IR2J 


Epstein-Barr virus 
(AG876 isolate) U2- 
IR2 domain encoding 
nuclear protein 
EBNA2, complete cds; 
Nuclear antigen 2 


2.00E-06 


337 


Z97876 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
295C6; HTGS 


1.30E-19 


AF003535J 


Homo sapiens LI 
element ORF2-like 
protein gene, partial cds 


7.00E-05 
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phase 1 . 










338 


M97159 


Mouse (clone 
pIL2)Bl 
dispersed repeat 
unit. 


1.10E-19 


A26882 


pIL2 hypothetical 
protein - rat 
(fragment)>GP:RATT 
DRJ Rat growth and 
transformation- 
dependent mRNA, 3' 
end; Growth and 
transformation 
dependent protein 


0.2 


339 


U30817 


Bos taurus very- 
long-chain acyl- 
CoA 

dehydrogenase 
mRNA, nuclear 
gene encoding 
mitochondrial 
protein, complete 
cds. 


4.70E-20 


ACDV_RAT 


ACYL-COA 

DEHYDROGENASE, 

VERY-LONG-CHAIN 

SPECIFIC 

PRECURSOR (EC 

1.3.99.-) 

(VLCAD)>PIR2:A548 
72 acyl-CoA 
dehydrogenase (EC 
1.3.99.-) very-long- 
chain-specific 
precursor - 

rat>GP:RATVLCAD_ 
1 Rat mRNA for very- 
long-chain Acyl-CoA 
dehydrogenase, compl 


8.10E-25 


340 


Y11535 


H.sapiens mRNA 
forSHOXb 
protein. [ 


2.80E-20 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 

Mil 


0.00027 


341 


AL008730 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
487J7; HTGS 
phase 1 . 


7.10E-21 


C40201 


artifact-warning 
sequence (translated 
ALU class C) - human 


0.001 


342 


U96629 


Human 

chromosome 8 
BAC clone 
CIT987SK-2A8 
complete 
sequence. 


5.30E-23 


ALU1 HUM 
AN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


3.80E-10 
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343 


U95743 


Homo sapiens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


2.10E-24 


UROM HUM 
AN 


UROMODULIN 
PRECURSOR 
(TAMM-HORSFALL 
URINARY 
GLYCOPROTEIN) 
(THP)>PIR2:A30452 
uromodulin precursor - 
human>GP:HUMUMO 
D_l Human 
uromodulin (Tamm- 
Horsfall glycoprotein) 
mRNA, complete cds; 
Uromodulin precursor 


1 ; 


344 


U 15972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


S20790 


extensin - 

almond>GP:PAEXTS_ 
1 P;amygdalus mRNA 
for extensin 


0.34 


345 


U I 5972 


Mus musculus 
homeobox 
(Hoxa7) gene, 
complete cds. 


4.00E-25 


CA24 CAEE 
L 


COLLAGEN ALPHA 
2(IV) CHAIN 
PRECURSOR>GP:CE 
COLA2IV_2 Qelegans 
a2(IV) collagen gene; 
Alternatively spliced 
transcript 


0.1 j 


346 


Z66242 


H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
84a4, reverse read 
cpg84a4.rtla. 


4.80E-26 


CEC35A5J 


Caenorhabditis elegans 
cosmid C35A5, 
complete sequence; 
C35A5;8; CDNA EST 
yk3 1 f6;5 comes from 
this gene; cDNA EST 
yk38hl ;3 comes from 
this gene; cDNA EST 
yk38hl;5 comes from 
this gene; 


7.70E-19 


347 


L25331 


Rattus norvegicus 
lysyl hydroxylase 
mRNA, complete 
cds. 


3.90E-26 


LYSH CHIC 
K 


PROCOLLAGEN- 
LYSINE,2- 
OXOGLUTARATE 5- 
DIOXYGENASE 
PRECURSOR (EC 
1.14.11.4) (LYSYL 
HYDROXYLASES 
R2:A23742 
procollagen-lysine 5- 
dioxygenase (EC 
1.14.1 1.4) precursor - 
chicken>GP:CHKLYH 
1 Chicken lysyl 


1.10E-43 
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III 

ID 
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DESCRIPTION 


P 
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ACCESSION 


DESCRIPTION 


P 

VALUE 












hydroxylase mRNA, 
complete cds 




348 


L81569 


Drosophila 
melanogaster 
(subclone 2 d7 
from PI DS04260 
(D68)) DNA 
sequence, 
complete 
sequence. 


3.30E-26 


CELC52B9_2 


Caenorhabditis elegans 
cosmid C52B9; Coded 
for by C; elegans 
cDNA cml ld6; weakly 
similar to S; cervisiae 
PTM1 precursor 
(SP:P32857) 


8.40E-29 


349 


U78082 


Human RNA 
polymerase 
transcriptional 
regulation 
mediator (h- 
MED6) mRNA, 
complete cds. 


2.30E-26 


HSU78082J 


Human RNA 
polymerase 
transcriptional 
regulation mediator (h- 
MED6) mRNA, 
complete cds; H- 
Med6p 


1.50E-16 


350 


U43381 


Human Down 
Syndrome region 
of chromosome 
21 DNA. 


2.10E-28 


HSMRNAEB 
1 


H;sapiens genomic 
DNA, integration site 
for Epstein-Barr virus; 
Hypothetical protein 


0.18 


351 


D50416 


Mouse mRNA for 
AREC3, ; 
complete cds. 


2.50E-29 


A29947 


prostaglandin- 
endoperoxide synthase 
(EC 1.14.99.1) 
precursor - 

sheep>GP:SHPCOXA_ 
1 Sheep prostaglandin 
endoperoxide 
synthetase 
(cyclooxygenase), 
complete cds; 
Cyclooxygenase 
precursor (EC 
1;14;99;1) 


0.81 
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" 118^193 


Human nuclear 

1 lUUlQll J 1U vJvUl 

factor I-B2 
(NFIB2) mRNA, 
complete cds. 


2 20E-29 


CFU30222 1 


Crithidia fasciculata 
fully edited ATPase 
subunit 6 (MURF4) 
mRNA, partial cds; 
Cryptogene 


0.53 


353 


Z92826 


Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
C18D11;HTGS 

jJllOoC 1 • 


1.10E-30 


SPAC1B3J 


S;pombe chromosome I 
cosmid clB3; 
Hypothetical protein; 
SPAC1B3;05, probable 
transcriptional 
regulator, len:630aa, 
similar ee* to YIL038C 
NOT3JYEAST, 
P06 102, general 
negative regulator, 


3.20E-35 


354 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


3.70E-32 


PVU72769J 


Phaseolus vulgaris 
PvPRP-12(Pvprpl-I2) 
mRNA, partial cds; 
Similar to cell wall 
proline rich 

protein>GP:PVU72769 

1 Phaseolus vulgaris 
PvPRP-12(Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich protein 


0.00049 


355 


B42455 


HS-1055-B2- 
G03-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate-CT 777 
Col=6 Row=N. 


1.30E-32 


CELT05H4J 


Caenorhabditis elegans 
cosmid T05H4; Similar 
to the beta transducin 
family; coded for by C; 
elegans cDNA 
ykl56ell;3; coded for 
by C; elegans cDNA 
ykl4c8;3; coded for by 
C; elegans cDNA 


6.90E-14 


356 


AF001905 


Homo sapiens 
cosmids E079, 
B0920 and A8 
from Xq25 X- 
linked 

lymphoproliferati 
ve disease gene 
candidate region, 
complete 
sequence. 


1.80E-33 


138344 


titin - human 


1 
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~357~ 


E03743 


DNA sequence 
including male 
hormone 
dependent gene 
derived from 
hamster 
frankorgan. 


1.10E-34 


CELC03A7_2 


Caenorhabditis elegans 
cosmid C03A7; Weak 
similarity to serotonin 
receptors 


0.59 


358 


U31199 


Human laminin 
gamma2 chain 
gene (LAMC2), 
exon 22 and 
flanking 
sequences. 


1.20E-35 


B44018 


laminin B2t chain - 
human>GP:HSLAMB2 
TB_1 H;sapiens mRNA 
for laminin 


1.20E-14 


359 


D14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds. 


2.00E-36 


D49544J 


Mouse mRNA for 
KIFC1, complete cds 


1.20E-23 


360 


AB000425 


Porcine DNA for 
endopeptidase 

and complete cds. 


8.20E-38 


POL4 DROM 
E 


RETROVIRUS- 

RELATEDPOL 

POLYPROTEIN 

(PROTEASE (EC 

3.4.23.-); REVERSE 

TRANSCRIPTASE 

(EC 2.7.7.49); 

ENDONUCLEASE) 

(TRANSPOSON 

412)>PIR1:GNFF42 

retrovirus-related pol 

polyprotein - fruit fly 

(Drosophila 

meianogaster) 

transposon 

412>GP:DMRT412G 
4 


0.65 


361 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 


8.80E-42 


156333 


apolipoprotein B - rat 
(fragment)>GP:RATA 
POLPBJ Rattus 
norvegicus (clone 
rb9E) apolipoprotein B 
apoB mRNA, 3' end 


0.23 
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L09647 


Rattus norvegicus 
hepatocyte 
nuclear factor 3a 
(HNF-3 beta) 
mRNA, complete 
cds. 


6.60E-42 


HN3B__RAT 


HEPATOCYTE 
NUCLEAR FACTOR 
3-BETA (HNF- 
3B)>GP:RATHNF3B_ 
1 Rattus norvegicus 
hepatocyte nuclear 
factor 3a (HNF-3 beta) 
mRNA, complete 
cds>TFD:TFDP01611 
- Polypeptides entry for 
factor HNF-3 (beta) 


8.10E-25 


363 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds. 


4.10E-43 


CELC34D4 1 
2 


Caenorhabditis elegans 
cosmid C34D4 


0.018 


364 


Z56764 


H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
13f7, reverse read 
cpgl3f7.rtla. 


1.40E-43 


S75263 


hypothetical protein - 
Synechocystis sp. (PCC 
6803)>GP:D90904_29 
Synechocystis sp; 
PCC6803 complete 
genome, 6/27, 630555- 
781448; Hypothetical 
protein; 

ORF ID:sll0983 


0.0028 


365 


AC002636 


*** 

SEQUENCING 
IN PROGRESS 
*** Drosophila 
melanogaster 
(subclone 2^g4 
from PI DS03323 
(D127))DNA 
sequence; HTGS 
phase 2. 


8.40E-44 


DMU95760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


3.40E-51 


366 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds. 


8.00E-44 


GLSL_RAT 


GLUTAM1NASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) 

(GLS)>GP:RATGAH_ 
1 Rattus norvegicus L- 
glutamine 
amidohydrolase 
mRNA, complete cds 


8.00E-29 



232 



WO 99/33982 



Table 2 



PCT/US98/27610 







] Nearest 
\ Neighbor 
\ (BlastN vs. 
; Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


367 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds. 


5.00E-45 


DMU95760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


4.80E-45 


368 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


4.10E-45 


PTPK HUMA 
N 


PROTEIN-TYROSINE 
PHOSPHATASE 
KAPPA PRECURSOR 
(EC 3. 1.3.48) (R-PTP- 
KAPPA)>GP:HSPTPK 
APJ H;sapiens mRNA 
for phosphotyrosine 
phosphatase kappa; 
Human 

phosphotyrosine 
phosphatase kappa 


4.70E-16 


369 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3. 


9.40E-47 


MMU53563J 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00012 


370 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds. 


8.10E-48 


HSU78310J 


Homo sapiens 
pescadillo mRNA, 
complete cds 


1.10E-21 


371 


AC000399 


Genomic 
sequence from 
Mouse 9, 
complete 
sequence. 


7.40E-48 


KIP2_YEAST 


KINESIN-LIKE 
PROTEIN 
KIP2>PIR1:C42640 
kinesin-related protein 
KIP2 - yeast 
(Saccharomyces 
cerevisiae)>GP:SCKIP 
2XVI_2 S;cerevisiae 
PEP4 and KIP2 genes 
encoding PEP4 
proteinase (partial) and 
kinesin-related protein 
KIP2>GP:SCLACHX 
VlJ7S;cerev 


0.14 


372 


AC002327 


*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Mouse 7; HTGS 
phase I, 3 


1.40E-48 


CHKC1A205 
1 


Chicken alpha-2 type-1 
collagen; amino acids - 
16 to 3; Precollagen 
alpha-2 


0.024 
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unordered pieces. 










373 


X67016 


H.sapiens mRNA 
for amphiglycan. 


9.00E-49 


CED2085_2 


Caenorhabditis elegans 
cosmid D2085, 
complete sequence; 
D2085;l; Similar to 
glutamine-dependent 
carbamoyl-phosphate 
synthase, aspartate 
carbamoyltransferase, 
dihydroorotase; cDNA 
EST 

cml6O>GP:CED2085 
_2 Caenorhabditis 
elegans cosmid D2085; 
D 


0.14 


374 


L10409 


Mouse fork head 
related protein 
(HNF-3beta) 
mRNA, complete 
cds. 


1.50E-49 


MMU04197J 


Mus musculus HNF3 
beta transcription factor 
(HNF3b) mRNA, 
partial cds; Sequence of 
this partial cDNA 
begins in the first third 
of the conserved 
HNF3/forkheadDNA 
binding domain 


1.20E-30 


375 


U01139 


Mus musculus 
B6D2F1 clone 
2C11B mRNA. 


1.20E-49 


SPBC3D5J4 


S;pombe chromosome 
11 cosmid c3D5; 
Unknown; 
SPBC3D5;14c, 
unknown; partial; 
serine rich, len:309aa, 
similar eg; to 
YNL283C, 
YN23_YEAST, 
P53832, hypothetical 
52;3 kd protein, 
(503aa), 


0.00091 


376 


Z82170 


Human DNA 
sequence from 
PAC 326L13 
containing brain- 
4 mRNA ESTs 
and polymorphic 
CA repeat. 


9.00E-50 


BSU55043_3 


Bacillus subtilis 
plasmid pPOD2000 
Rep, RapAB, RapA, 
ParA, ParB, and ParC 
genes, complete cds; 
ORF3 


0.025 
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~TFT 


Z99289 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
142L7; HTGS 
phase 1. 


7.70E-50 


A64431 


hypothetical protein 
MJ1050- 
Methanococcus 
jannaschii>GP:MJU67 
548_2 Methanococcus 
jannaschii from bases 
986219 to 996377 
( section 90 of 1 50} of 
the complete genome; 
M; jannaschii predicted 
coding region MJ1050; 
Identified by 
GeneMark; putativ 


5.60E-05 


378 


X98260 


H.sapiens mRNA 
for M-phase 
phosphoprotein, 
mppll. 


6.20E-50 


ZRF1 MOUS 
E 


ZUOTIN RELATED 
FACTOR>GP:MMU53 
208_1 Mus musculus 
zuotin related factor 
(ZRFl)mRNA, 
complete cds; Similar 
to DnaJ encoded by 
GenBank Accession 
Number LI 6953 


3.90E-30 


379 


M18981 


Human prolactin 
receptor- 
associated protein 
(PRA) gene, 
complete cds. 


9.00E-52 


SI 06 HUMA 
N 


CALCYCLIN 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 
PROTEIN) (PRA) 
(GROWTH FACTOR- 
INDUCIBLE 
PROTEIN 2 A9) (SI 00 
CALCIUM-BINDING 
PROTEIN 
A6)>PIR1:BCHUY 
calcyclin - 

human>GP:HUMCAC 
Y_l Human calcyclin 
gene, complete 
cds>GP:HUMCACYA 

1 Human prolactin 
recept 


8.80E-24 


380 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 gene, 
partial cds. 


1.60E-53 


S33015 


lypothetical protein - 
human herpesvirus 4 


0.00088 
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381 


U53225 


Human sorting 
nexin 1 (SNX1) 
mRNA, complete 
cds. 


1.80E-55 


G02522 


sorting nexin 1 - 
human>GP:HSU53225 

1 Human sorting 
nexin 1 (SNX1) 
mRNA, complete cds 


9.20E-50 


382 


Z92844 


Human DNA 
sequence from 
PAC 435C23 on 
chromosome X. 
Contains ESTs. 


6.50E-56 


D14487J 


Lentinus edodes 
Le;MFBl mRNA, 
complete cds 


1 


383 


D87450 


Human mRNA 
for KIAA0261 
gene, partial cds. 


4.30E-56 


D87450J 


Human mRNA for 
KIAA0261 gene, 
partial cds; Similar to 
D;melanogaster parallel 
sister chromatids 
protein 


4.30E-30 


384 


AC002301 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
chromosome 
+16pll.2 BAC 
clone CIT987SK- 
A-328A3; HTGS 
phase 2, 1 
ordered pieces. 


9.80E-57 


S62328 


kinesin-like DNA 
binding protein KID - 
human>GP:HUMKID_ 
1 Human mRNA for 
Kid (kinesin-like DNA 
binding protein), 
complete cds 


2.60E-27 


385 


L29766 


Homo sapiens 
epoxide hydrolase 
(EPHX)gene, 
complete cds. 


7.30E-57 


HSBCTCF4J 


Homo sapiens mRNA 
forhTCF-4 


2.30E-05 


386 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin. 


3.30E-58 


MMU58884J 


Mus musculus SH3- 
containing protein 
SH3P7 mRNA, 
complete cds; similar to 
Human Drebrin; SH3- 
containing protein; 
similar to human 
drebrin 


6.00E-43 


387 


Y15054 


Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 


9.50E-59 


RNY15054J 


Rattus norvegicus 
mRNA for 70 kDa 
tumor specific antigen, 
partial; 70 kD tumor- 
specific antigen 


4.70E-45 
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(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastXvs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


388 


AC000406 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 1 1 
overlapping pacs 
pDJ235kl0and 
pDJ239b22; 
HTGS phase 1, 
17 unordered 
pieces. 


7.40E-59 


<NONE> 


<NONE> 


<NONE> 


389 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds. 


3.60E-59 


KRHUEA 


keratin, type II 
cytoskeletal - human 
(fragment)>GP:HSKE 
RA l Human 
messenger fragment 
encoding cytoskeletal 
keratin (type II); 
mRNA from cultured 
epidermal cells from 
human 

foreskin>GP:HUMKE 
R56KJ Human 56k 
cytoskeletal type II 
keratin mRNA 


7.60E-30 


390 


L29766 


Homo sapiens 
epoxide hydrolase 
(EPHX)gene, 
complete cds. 


2.70E-60 


EGR2 HUMA 
N 


EARLY GROWTH 
RESPONSE PROTEIN 
2 (EGR-2) (KROX-20 
PROTEIN) 

(AT591)>GP:HUMEG 
R2A_1 Human early 
growth response 2 
protein (EGR2) 
mRNA, complete 
cds>TFD:TFDP00485 
- Polypeptides entry for 
factor Egr-2 


7.80E-06 


391 


L08758 


Mus musculus 
homeobox protein 
(Hox A 10) gene, 
5' end of cds. 


1.40E-60 


PAALGYGE 

NJ ' 


P;aeruginosa algY 
gene; Alginate lyase 


0.00031 


392 


129058 


Sequence 3 from 
patent US 
5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2- 

human>GP:D50645J 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived j 


1.50E-32 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
i ID 


ACCLoMON 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












factor-2 




393 


129058 


Sequence 3 from 
patent US 
5576423. 


4.20E-61 


JC5106 


stromal cell-derived 
factor 2 - 

human>GP:D50645J 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 
factor-2 


1.50E-32 


394 


U46067 


Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 


1.90E-62 


CHU46067_1 


Capra hircus beta- 
mannosidase mRNA, 
complete cds 


2.70E-39 


395 


U40747 


Mus musculus 
formin binding 
protein 1 1 
mRNA, partial 
cds. 


6.90E-63 


S64713 


formin binding protein 
1 1 - mouse 

(fragment)>GP:MMU4 
0747_J Mus musculus 
formin binding protein 
1 1 mRNA, partial cds; 
FBP 11; Formin 
binding protein 11; 
tandem WWP/WW 
domains separated by 
15 amino acid linker 


3.00E-46 


396 


M36164 


Human 

glyceraldehyde-3- 
phosphate 
dehydrogenase 
mRNA, y flank. 


1.10E-63 


BHT1UL_12 


Bovine herpesvirus 
type 1 UL22-35 genes; 
UL26;5>GP:BHU3180 
9_2 Bovine herpesvirus 
1 maturational 
proteinase (UL26) 
gene, complete cds, and 
scaffold protein 
(UL26;5)gene, 
complete cds 


0.003 


397 


Y09036 


H.sapiens 
NTRK1 gene, 
exon 17. 


7.30E-65 


MMU39060_1 


Mus musculus 
glucocorticoid receptor 
i nteracting protein I 
(GRIP 1) mRNA, 
complete cds; 
Hormone-dependent 
interaction with 
hormone binding 
domains of steroid 
receptors; 


0.0054 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
Tn 


ACCESSION 


DESCRIPTION 


P 

VAT ITF 


ACCESSION 


DESCRIPTION 


P 

VAT TT1T 












transactivation 




398 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


2.70E-70 


JC4239 


phospholipase A2- 
activating protein - rat 


8.40E-17 


399 


; D12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds. 


1.70E-74 


KIF4 MOUS 
E 


KINESIN-LIKE 
PROTEIN 
K1F4>PIR2:A54803 
microtubule-associated 
motor KIF4 - 
mouse>GP:MUSKIF4_ 
1 Mouse kif4 mRNA 
for microtubule-based 
motor protein KIF4, 
complete cds; ATP- 
binding site: base980- 
1037, motor domain: 
base732-1781, alpha- 
helical CO 


1.10E-44 


400 


AF007860 


Xenopus laevis 
xl-Mago mRNA, 
complete cds. 


4.60E-75 


AF007862J 


Mus musculus mm- 
Mago mRNA, 
complete cds; Similar 
to Drosophila 
melanogaster Mago 
protein 


6.50E-68 


401 ; 


145565 


Sequence 15 from 
patent US 
5637463. 


2.30E-82 


RNU57391J 


Rattus norvegicus 
FceRI gamma-chain 
interacting protein 
SH2-B(SH2-B) 
mRNA, complete cds; 
Putative FceRI gamma 
ITAM interacting 
protein; SH2 domain- 
containing protein B; 
Method: conceptual 


9.90E-42 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


402 


U29156 


Mus musculus 
epslSRmRNA, 
complete cds. 


1.00E-85 


MMU29156J 


Mus musculus epsl5R 
mRNA, complete cds; 
Involved in signaling 
by the epidermal 
growth factor receptor; 
Method: conceptual 
translation supplied by 
author 


4.90E-62 


403 


U70139 


Mus musculus 
putative CCR4 
protein mRNA, 
partial cds. 


1.00E-85 


MMU70139J 


Mus musculus putative 
CCR4 protein mRNA, 
partial cds; Similar to 
yeast transcription 
factor CCR4; 
transcriptional 
readthrough occurs 
with transcription being 
initiated at the IAP and 
continues 


7.20E-66 


404 


U82626 ! 


Rattus norvegicus 
basement 
membrane- 
associated 
chondroitin 
proteoglycan 
Bamacan mRNA, 
complete cds. 


7.60E-96 


RNU82626J 


Rattus norvegicus 
basement membrane- 
associated chondroitin 
proteoglycan Bamacan 
mRNA, complete cds; 
Chondroitin sulfate 
proteoglycan; CSPG 


8.20E-58 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


405 


L09604 


Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 


2.00E-35 


<NONE> 


<NONE> 


<NONE> 


406 


AB000516 


Homo sapiens 
mRNA for DSIF 
pi 60, complete 
cds 


0.41 


POLGJTUMVQ 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTFINA^F (PC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; VPG 
PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A fNI-A") 


2.9 


407 


Z94753 


Human DNA 
sequence from 
PAC 465G10on 
chromosome X 
contains Menkes 
Disease (ATP7A) 
putative Cu++- 
transporting P- 
type ATPase 
exons 22, 23 and 
STS 


0.004 


<NONE> 


<NONE> 


<NONE> 


408 


AB011123 


Homo sapiens 
mRNA for 
KIAA0551 
protein, partial 
cds 


0 


MI15_CAEEL 


Q23356 

caenorhabditis 

elegans. 

serine/threonine- 
Drotein kinase mig- 
15 (ec 2.7.1.-). 11/98 


2.00E-51 


409 


D17218 


Human HepG2 3 1 
region Mbpl 
cDNA, clone 
hmd3g02m3 


e-123 


NARG BACSU 


NITRATE 
REDUCTASE 
ALPHA CHAIN (EC 
1.7.99.4) 


9.9 


410 


M95098 ] 


Bos taurus 
ysozyme gene 
[cow 2), complete 
:ds 


1.1 


HAIR MOUSE 1 
1 


HAIRLESS 
PROTEIN 


8.00E-10 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


\ DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


411 


Z60048 


H.sapiens CpG 
DNA, clone 
187a9, reverse 
read 

cpgl87a9.rtla. 


4.00E-54 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


4.00E-21 


412 


Z48975 


P.magnus gene 
for protein urPAB 


0.014 


YPT2_CAEEL 


HYPOTHETICAL 
21.6 KD PROTEIN 
F37A4.2 IN 
CHROMOSOME III 


2.00E-12 


413 


AJ001296 


Notophthalmus 
viridescens 
mRNA for 
cytokeratin 8 


0.37 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03 IN 
CHROMOSOME I 


5.00E-21 


414 


J03831 


Xenopus laevis 
(clonepXEC1.3) 
C protein mRNA, 
complete cds. 


0.37 


PDR5JYEAST 


SUPPRESSOR OF 
TOXICITY OF 
SPORIDESMIN 


3.3 


415 


AB007157 


Homo sapiens 
gene for 

ribosomal protein 
S21, partial cds 


e-142 


RS2 INHUMAN 


40S RIBOSOMAL 
PROTEIN S21 


0.002 


416 


X86340 


H.sapiens C7 
gene, exon 13 


3.3 


STCJDROME 


SHUTTLE CRAFT 
PROTEIN 


4.3 


417 


U12404 


Human Csa-19 
mRNA, complete 
cds. 


0 


R10A_PIG 


60S RIBOSOMAL 
PROTEIN LI OA 
(CSA-19) 
(FRAGMENT) 


9.00E-57 


418 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


419 


M80198 


Human FKBP-12 
pseudogene, clone 
lambda-512, 5' 
flank and 
complete cds. 


5.00E-14 


RC01_NEUCR 


TRANSCRIPTIONA 
L REPRESSOR 
RCO-1 


0.008 


420 


AF052573 

J 
< 


Homo sapiens 
DNA polymerase 
eta(POLH) 
mRNA, complete 
:ds 


0 


<NONE> 


<NONE> 


<NONE> 


421 


AF035940 I 

1 


Homo sapiens 
VIAGOH mRNA, 
complete cds 


e-131 


MGN_DROME ] 
1 


vlAGONASHI 
PROTEIN 


4.00E-39 


422 


AF054994 ] 
< 
i 


4omo sapiens 
;Ione 23832 
nRNA sequence 


0.12 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


423 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


424 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


425 


D43952 


Mouse gene for 
reticulocalbin, 
exonl and 
promoter region 


0.36 


<NONE> 


<NONE> 


<NONE> 


426 


X68553 


C.elegans 
repetitive DNA 
sequence 


0.4 


TCB1 RABIT 


T-CEl I 

RECEPTOR BETA 
CHAIN 

PRECURSOR (ANA 
ID 


1 nil 

U.I 1 


427 


M83314 


Tomato 
phenylalanine 
ammonia lyase 
(pal) gene, 
complete cds and 
promoter region. 


3.3 


SMB2_HUMAN 


DNA-BINDING 
PROTEIN SMUBP-2 
(GLIAL FACTOR- 11 
(GF-1) 


0.65 


428 


AF070636 


Homo sapiens 
clone 24686 
mRNA sequence 


5.00E-23 


<NONE> 


<NONE> 


<NONE> 


429 


<NONE> 


<NONE> 


<NONE> 


IQGA_HUMAN 


RAS GTPASE- 
ACTIVATING- 
LIKE PROTEIN 
IQGAP1 (PI 95) 


2.00E-06 


430 


AF068627 


Mus musculus 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


LOXl_LENCU 


LIPOXYGENASE 
(EC 1.13.11.12) 


9.9 


431 


AF020043 


Homo sapiens 
chromosome- 
associated 
polypeptide 


0 


YJH4_YEAST 


HYPOTHETICAL 
141.3 KD PROTEIN 
IN SCP160-MRPL8 
INTERGENIC 
REGION 


4.00E-16 


432 


K00046 

] 
i 


ross river virus 
26s subgenomic 
rna and junction 
region. 


0.12 


CUL2 HUMAN 

] 


CULLIN 

HOMOLOG2 

[CUL-2) 


7.4 
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SEQ 
in 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


433 


AF005664 


Homo sapiens 
properdin (PFC) 
gene, complete 
cds 


0.005 


UL88_HCMVA 


PROTEIN UL88 


5.8 


434 


Z70705 


H.sapiens mRNA 
(fetal brain cDNA 
com5) 


2.00E-05 


PH87_YEAST 


INORGANIC 
PHOSPHATE 
TRANSPORTER 
PH087 


1.5 


435 


U29156 


Mus musculus 
epslSRmRNA, 
complete cds. 


e-125 


EP15_HUMAN 


EPIDERMAL 
GROWTH FACTOR 
RECEPTOR 
SUBSTRATE 
SUBSTRATE 15 
(PROTEIN EPS1 5) 
(AF- IP PROTEIN) 


1.00E-13 


436 


AE000750 


Aquifex aeolicus 
section 82 of 109 
of the complete 
genome 


0.37 


<NONE> 


<NONE> 


<NONE> 


437 


U49169 


Dictyostelium 
discoideum V- 
ATPase A subunit 
(vatA) mRNA, 
complete cds 


0.12 


VCAP_HSV6U 


MAJOR CAPSID 
PROTEIN (MCP) 


5.6 




AF032871 


Homo sapiens 
uncoupling 
protein 3 (UCP3) 
gene, exon 1 and 
partial exon 2 


0.13 


WEEl_SCHPO 


MITOSIS 
INHIBITOR 
PROTEIN KINASE 
WEE1 (EC 2.7.1.-) 


3.7 


439 


AB000425 


Porcine DNA for 
endopeptidase 
24.16, exon 16 
and complete cds 


4.00E-32 


<NONE> 


<NONE> 


<NONE> 


440 


U51037 


Mus musculus 11- 
zinc-finger 
transcription 
factor 


0.04 


<NONE> 


<NONE> 


<NONE> 


441 


AF032456 


Homo sapiens 
ubiquitin 
conjugating 
enzyme G2 


e-110 


<NONE> 


<NONE> 


<NONE> 


442 


AF009288 


Homo sapiens 
clone HEB8 Cri- 
du-chat region 
mRNA 


2.00E-14 


LMG INHUMAN 


LAMININ 
GAMMA- 1 CHAIN 
PRECURSOR 
(LAMININ B2 
CHAIN) 


8.1 



244 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


443 


AF024578 


Homo sapiens 
type-1 protein 
phosphatase 
skeletal muscle 
glycogen 
targeting subunit 
(PPPIR^ eene 
exon 4, and 
complete cds 


1.1 


<NONE> 


<NONE> 


<NONE> 


444 


M24486 


Human prolyl 4- 
hydroxylase alpha 
subunit mRNA, 
complete cds, 
clone PA-11. 


0 


DACHA 


<NONE> 


4.00E-58 


445 


X96400 


P.tetraurelia 
alpha-51Dgene 


0.37 


<NONE> 


<NONE> 


<NONE> 


446 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


447 


X84996 


X.laevis mRNA 
for selenocysteine 
tRNA acting 
factor (Staf) 


0.12 


POL_MLVRD 


POL 

POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASEH 
(EC 3.1.26.4)) 


2.00E-08 


A AO 

448 


AF019980 


Dictyosteiium 
discoideum ZipA 
(zipA) gene, 
partial cds 


3.4 


HMDL_BRAFL 


HOMEOBOX 
PROTEIN DLL 
HOMOLOG 


0.23 




X78424 


D.carota (Queen 
Anne's Lace) 
Inv*Dc2 gene, 


0.38 


<NONE> 


<NONE> 


<NONE> 


450 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


451 


X89886 


P.patens mRNA 
for 5- 

aminoievulinate 
dehydratase 


1.1 


CKR6_HUMAN 


C-C CHEMOKINE 
RECEPTOR TYPE 6 
(C-C CKR-6) 
(CCR6) 


9.9 


452 


U67471 


Methanococcus 
jannascnu section 
13 of 150 of the 
complete genome 


0.12 


YR72_ECOLI 


HYPOTHETICAL 

(ORF2) (RETRON 
EC67) 


5.8 


453 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 
106(Zfpl06) 
mRNA, H3a-a 
allele, complete 
cds 


1.00E-62 


YOJ8_CAEEL 


HYPOTHETICAL 
51.6 KD PROTEIN 
ZK353.8 IN 
CHROMOSOME III 


1.7 
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454 


U70667 


Human Fas-ligand 
associated factor 
1 mRNA, partial 
cds 


0 


YKB2JVEAST 


HYPOTHETICAL 
69.1 KD PROTEIN 
IN PUT3-CCE1 
INTERGENIC 
REGION 


3.00E-09 


455 


MVjojo 


Bos taurus 
recoverin mRNA, 
complete cds. 


A 1 C 

0.35 


GIDA_MYCGE 


GLUCOSE 
INHIBITED 
DIVISION 
PROTEIN A 


1.4 


456 




Methanococcus 
jannaschii section 
136 of 150 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


4j / 


X06747 


Human hnRNP 
core protein A 1 


3.00E-31 


<NONE> 


<NONE> 


<NONE> 


458 


Z65575 


H.sapiens CpG 
DNA, clone 47c5, 
reverse read 
cpg47c5.rtla . 


1.3 


<NONE> 


<NONE> 


<NONE> 


459 


X88893 


C.jacchus intron 4 
of visual pigment 
gene 


5.00E-15 


<NONE> 


<NONE> 


<NONE> 


460 


M57426 


Maize stripe virus 
RNA3 
nonstructural 
protein 


0.33 


DSC2_MOUSE 


DESMOCOLLIN 
2A/2B 

PRECURSOR 
(EPITHELIAL 
TYPE 2 

DESMOCOLLIN) 


6.5 


461 


aU163o 


Yeast TEF1 gene 
for elongation 
factor EF-1 alpha 


1.1 


PPOL_DROME 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


3.5 


462 


M60064 


S.typhimurium 
glutamate 1- 
semialdehyde 
aminotransferase 
(hemL) gene, 
complete cds. 


1.1 


EPB4JMOUSE 


EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.1 12) KINASE 
2) (TYROSINE 
KINASE MYK-1) 


2.5 


463 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.36 


ACHG_XENLA 

] 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 


1.5 


464 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
;ds. 


2.00E-58 


VG13_BPML5 


GENE 13 PROTEIN 
:GP13) 


2.5 



246 



WO 99/33982 PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

ID 
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DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


465 


M77235 


Human cardiac 
tetrodotoxin- 
insensitive 
voltage-dependent 
sodium channel 
alpha subunit 
(HHl)mRNA, 
complete cds. 


3.8 


ZPBOC1 


<NONE> 


6.9 


466 


M58330 


C.maltosa 
autonomously 
replicating 
sequence. 


0.004 


EPB4_MOUSE 


EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.112) KINASE 
2) (TYROSINE 
KINASE MYK-1) 


2.4 


467 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.35 


ACHG_XENLA 


ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 


2.4 


468 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


7.00E-59 


VGLI_PRVRI 


GLYCOPROTEIN 
GP63 PRECURSOR 


4.3 


469 


U65939 


Azotobacter 
vinelandii GTPase 
(ftsA) gene, 
partial cds, and 
ATP binding 
protein (ftsZ) 
gene, complete 
cds 


1.1 


TRUA_BACSP 


Q45557 bacillus sp. 
strain ksm-64). trna 
pseudouridine 
synthase a (ec 
4.2.1.70) 
(pseudouridylate 
synthase i) 
(pseudouridine 
synthase i) (uracil 
hydrolyase). 1 1/98 


0.001 


470 


U51037 


Mus musculus 11- 
zinc-finger 
transcription 
factor 


0.041 


<NONE> 


<NONE> 


<NONE> 


471 


M32685 


Human platelet 
glycoprotein Ilia, 
exon 14. 


3.6 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


472 


U82691 


Phrynocephalus 
raddei CAS 
179770 NADH 
dehydrogenase 
subunit 1 (ND1), 
partial cds, tRNA- 
Gln, tRNA-Ile 
and tRNA-Met, 
NADH 

dehydrogenase 
subunit 2 tRNA- 
Cys and tRNA- 
Tyr and c... 


1.1 


<NONE> 


<NONE> 


<NONE> 


473 


D85430 


Mouse Murrl 
mRNA, exon 


0.12 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 
2.7.1.112) 


2.5 


474 


U20661 


Dictyostelium 
discoideum 
unknown internal 
repeat protein 
gene, complete 
cds, and unknown 
orfl,orf2and 
orO genes, partial 
cds 


0.36 


YHL1_EBV 


HYPOTHETICAL 
BHLF1 PROTEIN 


4.00E-04 


475 


X56537 


Human novel 
homeobox mRNA 
for a DNA 
binding protein 


0.04 


FA5_HUMAN 


COAGULATION 
FACTOR V 
PRECURSOR 
(ACTIVATED 
PROTEIN C 
COFACTOR) 


9.5 


4/0 


U32843 


Haemophilus 
influenzae Rd 
section 158 of 163 
of the complete 
genome 


5 


<NONE> 


<NONE> 


<NONE> 


477 


U67554 


Methanococcus 
iannaschii section 
96 of 150 of the 
complete genome 


0.36 


<NONE> 


<NONE> 


<NONE> 


478 


AB004244 


Narke japonica 
mRNA for Nj- 
synaphin lb, 
complete cds 


1.1 


NIAl_ORYSA 


NITRATE 
REDUCTASE 1 (EC 
1.6.6.1) (NR1) | 


1.00E-07 


479 


AF075079 


Homo sapiens full 
length insert 
cDNA YQ80A08 


1.00E-12 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


480 


AE000723 


Aquifex aeolicus 
section 55 of 109 
of the complete 
genome 


i 


YKK0_YEAST 


HYPOTHETICAL 
67.5 KD PROTEIN 
IN APE1/LAP4- 
CWP1 

INTERGENIC 
REGION 


9.1 


481 


X73902 


H.sapiens mRNA 
for nicein B2 
chain 


0 


LMG2_HUMAN 


LAMIN1N 
GAMMA-2 CHAIN 
PRECURSOR 


3.00E-93 


482 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-10 


P53_CRIGR 


CELLULAR 
TUMOR ANTIGEN 
P53 


5.7 


483 


ALO 10240 


Plasmodium 
falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-64, complete 
sequence 


1.2 


<NONE> 


<NONE> 


<NONE> 


484 


U49919 


Arabidopsis 
thaliana lupeol 
synthase mRNA, 
complete cds 


0.54 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03IN 
CHROMOSOME I 


6.00E-10 


485 


AF077618 


Homo sapiens 
p73 gene, exon 3 


0.39 


MYOD_MOUSE 


MYOBLAST 
DETERMINATION 
PROTEIN 1 


2.1 


486 


AF054994 


Homo sapiens 
clone 23832 
mRNA sequence 


0.13 


<NONE> 


<NONE> 


<NONE> 


487 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


488 


AF068627 


Mus musculus 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 
cds 


5.00E-04 


ACE2_YEAST 


METALLOTHIONE 
IN EXPRESSION 
ACTIVATOR 


1.5 


489 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-07 


RINI_P1G 


RIBONUCLEASE 
INHIBITOR 


0.19 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


490 


L77886 


Human protein 
tyrosine 
phosphatase 
mRNA, complete 
cds 


1.00E-21 


VS48_TBRVS 


SATELLITE RNA 
48 KD PROTEIN 


1.6 


491 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


5.00E-04 


CRP3_LIMPO 


C-REACTIVE 
PROTEIN 3.3 
PRECURSOR 


3.5 


492 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


EPA5_CHICK 


EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (EC 
2.7.1.112) 


2.7 


493 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.O0E-O9 


<NONE> 


<NONE> 


<NONE> 


494 


U28153 


Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 


0.37 


<NONE> 


<NONE> 


<NONE> 


495 


U95094 


Xenopus laevis 
XL-INCENP . 
(XL-INCENP) 
mRNA, complete 
cds 


0.37 


NCPR_YEAST 


NADPH- 
CYTOCHROME 
P450 REDUCTASE 
(EC 1.6.2.4) (CPR) 


7.00E-05 


496 


U95102 j 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.013 


YMB3_CAEEL 


PROBABLE 
INTEGRIN ALPHA 
CHAIN F54G8.3 
PRECURSOR 


3.3 


497 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-07 


<NONE> 


<NONE> 


<NONE> 


498 


U95094 


Xenopus laevis 
XI -INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-10 


<NONE> 


<NONE> 


<NONE> 


499 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


VGLY_LYCVW 


GLYCOPROTEIN 

POLYPROTEIN 

PRECURSOR 

(CONTAINS: 

GLYCOPROTEINS 

Gl ANDG2) 


3.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


500 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-06 


HR78JDROME 


NUCLEAR 

HORMONE 

RECEPTOR HR78 

(DHR78) 

(NUCLEAR 

RECEPTOR 

XR78E/F) 


2.5 


501 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


MYSH_BOVIN 


MYOSIN 1 HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


4.00E-04 


502 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-04 


BAL_HUMAN 


BILE-SALT- 
ACTIVATED 
LIPASE 

PRECURSOR (EC 
3.1.1.3) (EC 
3.1.1.13)(BAL) 
(BILE-SALT- 
STIMULATED 
LIPASE) (BSSL) 
ESTERASE) 
(PANCREATIC 
LYSOPHOSPHOLIP 
ASE) 


2.6 


503 


AF080399 


Drosophila 
melanogaster 
mitotic 
checkpoint 
control protein 
kinase BUB1 
(Bubl) mRNA, 
complete cds 


1.1 


NAT1 JYEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


2.00E-23 


504 


U59706 


Gal I us gallus 
alternatively 
spliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


505 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 


506 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 



251 



WO 99/33982 



PCT/US98/27610 



Table 2 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCbSSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


507 


AF100661 


Caenorhabditis 
elegans cosmid 
H20E11 


0.38 


<NONE> 


<NONE> 


<NONE> 


508 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-11 


CA1A_HUMAN 


COLLAGEN 
ALPHA 1(X) 
CHAIN 
PRECURSOR 


0.024 


509 


U47322 


Cloning vector 
DNA, complete 
sequence. 


2.O0E-38 


COA1_SV40 


COAT PROTEIN 
VP I 


6.2 




AF031924 


Homo sapiens 
homeobox 
transcription 
factor barx2 


e-156 


CCMA_HAEIN 


HEME EXPORTER 
PROTEIN A 
(CYTOCHROME C- 
TYPE BIOGENESIS 
ATP-BINDING 
PROTEIN CCMA) 


3.5 


511 


AFO 10484 


Homo sapiens ICI 
YAC 9IA12, right 
end sequence 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


512 


Z63829 


H.sapiens CpG 
DNA, clone 90h2, 
forward read 
cpg90h2.ftla. 


5.00E-22 


NFIR_MESAU 


NUCLEAR 
FACTOR 1 CLONE 
PNF1/RED1 (NF-I) 
(CCAAT-BOX 
BINDING 
TRANSCRIPTION 
FACTOR) (CTF) 
(TGGCA-BINDING 
PROTEIN) 


2-4 


513 


235094 


H.sapiens mRNA 
for SURF-2 


5.00E-97 


SUR2_HUMAN 


SURFEIT LOCUS 
PROTEIN 2 


1.00E-46 


514 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 


515 


D38417 


Mouse mRNA for 
arylhydrocarbon 
receptor, 
complete cds 


e-154 


TEGUEBV 


LARGE 

TEGUMENT 

PROTEIN 


3.4 


516 


L10911 

( 


Homo sapiens 
splicing factor 
[CC1.4) mRNA, 
:omplete cds. 


e-117 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


517 


XI 7093 


Human HLA-F 
gene for human 
leukocyte antigen 
F 


0.009 


YENl_SCHPO 


013695 

schizosaccharomyces 
pombe (fission 
yeast), hypothetical 
52.9 kd serine-rich 
protein cllg7.01 in 
chromosome i. 1 1/98 


5.4 


518 


AB017026 


Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYB_HUMAN 


OXYSTEROL- 
BINDING PROTEIN 


1.00E-40 


519 


X55038 


Mouse mCENP-B 
gene for 
centromere 
autoantigen B 


0.001 


YNW7JYEAST 


HYPOTHETICAL 
68.8 KD PROTEIN 
IN URE2-SSU72 
INTERGENIC 
REGION 


3.00E-04 


520 


AB018323 


Homo sapiens 
mRNA for 
KIAA0780 
protein, partial 
cds 


3.00E-41 


LBR_CHICK 


LAMIN B 
RECEPTOR 


2.3 


521 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-10 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 
PRECURSOR 


0.002 


522 


X03558 


Human mRNA 
br elongation 
factor 1 alpha 
subunit 


0 I 


EF I INHUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-l-ALPHA-1) 


e-110 


523 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-11 


YMT8_YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP116-FAR3 
INTERGENIC 
REGION 


8.00E-07 


524 


ABO 14591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


0 


NOT2_YEAST 


GENERAL 
NEGATIVE 
REGULATOR OF 
TRANSCRIPTION 
SUBUNIT 2 


8.00E-05 


525 


AB019488 


Homo sapiens 
DNA for TRKA, 
exon 17 and 
complete cds 


0 


TRKA_HUMAN 


HIGH AFFINITY 

NERVE GROWTH 

FACTOR 

RECEPTOR 

PRECURSOR 

PROTEIN) (P140- 

rRKA) 


2.00E-27 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


1 ACCESSION 


DESCRIPTION 


P VALUE 


526 


U95102 
• 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-15 


CNG4_BOVIN 


240K PROTEIN OF 
ROD 

PHOTORECEPTOR 
CNG-CHANNEL 
CYCLIC- 
NUCLEOTIDE- 
GATED CATION 
CHANNEL 4 (CNG 
CHANNEL 4) 
MODULATORY 
SUBUNIT)) 


0.018 


527 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-06 


HMZIJDROME 


ZERKNUELLT 
PROTEIN 1 (ZEN-1) 


0.88 


528 


J03750 


Mouse single 
stranded DNA 
binding protein p9 
mRNA, complete 
cds. 


e-135 


P15_HUMAN 


ACTIVATED RNA 
POLYMERASE 11 
TRANSCRIPTIONA 
LCOACTIVATOR 
P15(PC4)(P14) 


3.00E-21 


529 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-12 


RS5_DROME 


40S RIBOSOMAL 
PROTEIN S5 


0.42 


530 


Z57610 


H.sapiens CpG 
DNA, clone 
187a 10, reverse 
read 

cpgl87al0.rtla. 


8.00E-61 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


4.00E-15 


531 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 


3.00E-60 


<NONE> 


<NONE> 


<NONE> 


532 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-11 


<NONE> 


<NONE> 


<NONE> 


533 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU INHUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.1 


534 


X92841 


H.sapiens MICA 
gene 


1.00E-55 


LIN INHUMAN 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-09 
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1 Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


535 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 
cds 


0 


NODC.BRAEL 


N- 

ACETYLGLUCOSA 
MINYLTRANSFER 
ASE (EC 2.4.1.-) 


1.4 


536 


M21731 


Human lipocortin- 
V mRNA, 
complete cds. 


e-169 


ANX5_HUMAN 


ANNEXIN V 
(LIPOCORTIN V) 
(ENDONEXIN II) 
(CALPHOBINDIN 
I) (CBP-I) 
(PLACENTAL 
ANTICOAGULANT 
PROTEIN 1) (PAP-I) 
ANTICOAGULANT 
-ALPHA) (VAC- 
ALPHA) 

(ANCHORINCII) 


1.00E-05 


537 


Y08013 


S.salar DNA 
segment 
containing GT 
repeat 


0.006 


<NONE> 


<NONE> 


<NONE> 


538 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


539 


M98502 


Mus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 


2.00E-17 


DYNA_CHICK 


DYNACTIN, 117 
KD ISOFORM 


7.4 


540 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


HXA3_HAEIN 


HEMErHEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.6 


541 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-13 


AMO_KLEAE 


AMINE OXIDASE 

PRECURSOR (EC 

1.4.3.6) 

(MONAMINE 

OXIDASE) 

(TYRAMINE 

OXIDASE) 


1.5 


542 


AF083322 


Homo sapiens 
centriole 

associated protein 
CEPllOmRNA, 
complete cds 


e-133 


CA34_HUMAN 


PROCOLLAGEN 
ALPHA 3(IV) 
CHAIN 
PRECURSOR 


1.5 


543 


J03746 


Human 
glutathione S- 
transferase 
mRNA, complete 
cds. 


e-170 


GTMI_HUMAN 


GLUTATHIONES- 
TRANSFERASE, 
MICROSOMAL (EC 
2.5.1.18) 


5.00E-39 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




T TATTOO 


ivieinanococcus 
iannaschii section 
64 of 150 of the 
complete genome 


ft 77 
U.J / 


A 1 A A TJIIMAXI 
A. 1 /\/\_Ji U 1V1 A IN 


A T D1J A 1 A 

ADRENERGIC 
RECEPTOR 




545 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


546 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


547 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


548 


D87001 


Human (lambda) 
DNA for 
immunoglobulin 
light chain 


0.35 


VAL3_TYLCU 


AL3 PROTEIN (C3 
PROTEIN) 


3.2 


549 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-08 


TEGU_HSV1 1 


LARGE 
TEGUMENT 
PROTEIN (VIRION 
PROTEIN UL36) 


0.004 


550 


D16991 


Human HepG2 
partial cDNA, 
clone 

hmd2d01m5 


8.00E-09 


PTM1_YEAST 


PROTEIN PTM1 
PRECURSOR 


0.033 


551 


M34025 


Human fetal Ig 
heavy chain 
variable region 


3.2 


<NONE> 


<NONE> 


<NONE> 


552 


M98502 


Mus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
complete cds. 


5.00E-14 


<NONE> 


<NONE> 


<NONE> 


553 


U95098 j 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


554 


Z78730 


H.sapiens flow- 
sorted 

chromosome 6 
Hindlll fragment, 
SC6pA15C3 


3.00E-20 


ALU1_HUMAN 


!!!!ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


5.00E-06 


555 


U74496 


Human 

chromosome 4q35 

subtelomeric 

sequence 


8.00E-08 


ICP4_VZVD 


TRANS-ACTING 
TRANSCRIPTIONA 
L PROTEIN ICP4 


0.39 



256 



WO 99/33982 



Table 2 



PCT/US98/27610 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


556 


U39875 


Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 


2.00E-56 


YHFK_ECOLI 


HYPOTHETICAL 
79.5 KD PROTEIN 
IN CRP-ARGD 
INTERGENIC 
REGION (0696) 


9.8 


557 


U65416 


Human MHC 
class I molecule 
(MICB) gene, 
complete cds 


0.12 


<NONE> 


<NONE> 


<NONE> 


558 


AG000037 


Homo sapiens 
genomic DNA, 
21 q region, clone: 
9H11A22 


5.00E-25 


<NONE> 


<NONE> 


<NONE> 


559 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-05 


<NONE> 


<NONE> 


<NONE> 


560 


AB007918 


Homo sapiens 
mRNA for 
KIAA0449 
protein, partial 
cds 


0.015 


VGLE HSV11 


GLYCOPROTEIN E 
PRECURSOR 


2.2 


561 


U58884 


Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin 


1.00E-73 


YCV2_YEAST 


HYPOTHETICAL 
13.8 KD PROTEIN 
IN PWP2-SUP61 
INTERGENIC 
REGION 


2.6 


562 


AB007878 


Homo sapiens 
KIAA0418 
mRNA, complete 
cds 


e-110 


GLU2_MAIZE 


GLUTELIN 2 
PRECURSOR 
(ZEIN-GAMMA) 
(27 KD ZEIN) 


0.72 


563 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


YJD6_YEAST 


HYPOTHETICAL 
49.0 KD PROTEIN 
IN NSP1-KAR2 
INTERGENIC 
REGION 


1.4 


564 


U27873 


Stealth virus 1 
clone 3B11T7 


0.002 


SYN1 HUMAN 


SYNAPSINS IA 
AND IB (BRAIN 
PROTEIN 4.1) 


1.6 


565 


L38951 


Homo sapiens 
importin beta 
subunit mRNA, 
complete cds 


2.00E-68 


VP2_BRD 


STRUCTURAL 
CORE PROTEIN 
VP2 


1.1 


566 


AF007155 


Homo sapiens 
clone 23763 
unknown mRNA, 
partial cds 


e-165 


YOHI_AZOVI 


HYPOTHETICAL 
33.2 KD PROTEIN 
IN 1BPB 5'REGION 


7.5 
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567 


Z56295 


H. sapiens CpG 
DNA, clone 10c2, 
forward read 
cpgl0c2.ftla . 


0.12 


A1AB_CANFA 


ALPHA- IB 
ADRENERGIC 
RECEPTOR 
(FRAGMENT) 


0.85 


568 


Z83792 


G.gallus 
microsatellite 
DNA (LEI0222 


0.12 


<NONE> 


<NONE> 


<NONE> 


569 


U11820 


Feline 

immunodeficienc 
y virus 

USIL2489J7B 
gag polyprotein 

complete cds, 
polymerase 
polyprotein (pol) 
gene, partial cds, 
vif protein (vif), 
complete cds, and 
envelope 
glycoprotein 
(env), complete 
cds, complete g... 


LI 


<NONE> 


<NONE> 


<NONE> 


570 


Ml 8065 


Mouse 18S and 
28 S ribosomal 
DNA, 5' 

hypervariable ! 
(Vr) region, clone 
ML 


6.00E-04 


CC40_YEAST 


CELL DIVISION 
CONTROL 
PROTEIN 40 


3.7 


571 


AF053645 


Homo sapiens 
cellular apoptosis 
susceptibility 
protein (CSE1) 
gene, exons 3 
through 10 


2.00E-07 


YMQ4_CAEEL 


HYPOTHETICAL 
25.8 KD PROTEIN 
K02D10.4 IN 
CHROMOSOME III 


4.3 


•J /Z 


X04588 


Human 2.5 kb 
mRNA for 
cytoskeletal 
tropomyosin 
TM30(nm) 


0 


<NONE> 


<NONE> 


<NONE> 


573 


AC001 159 


(subclone 1 h9 
Tom PAC H92) 
DNA sequence 




YVNn PPI PI 


pxino i a npTA 

CfNLHJ- 1 ,4-DC 1 A- 

XYLANASED 
PRECURSOR (EC 
3.2.1.8) 




574 


Z60625 


H.sapiens CpG 
DNA, clone 2c 10, 
forward read 
cpg2c!0.ftlaa. 


4.00E-13 


<NONE> 


<NONE> 


<NONE> 


575 


AF070640 


iomo sapiens 
clone 24781 


e-164 


<NONE> 


<NONE> 


<NONE> 
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P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






mRNA sequence 










576 


Y11306 


Homo sapiens 

r> XT A iTr<r 

mRNA tor nTCF- 
4 


2.00E-48 


TCF1_HUMAN 


T-CELL-SPECIFIC 
TRANSCRIPTION 
FACTOR 1 (TCF-1) 


2.00E-15 


577 


X65279 


pWE15 cosmid 
vector DNA 


7.00E-69 


OCLN_POTTR 


Q28793 potorous 
tridactylus (potoroo). 
occludin. 1 1/98 


0.71 


J /o 


Ml 0296 


Mouse DNA with 
homology to EBV 
IR3 repeat, 
segment 1, clone 
Mu2. 


0.001 


LMB1_HYDAT 


LAM1NIN BETA-1 
CHAIN 
PRECURSOR 
(FRAGMENTS) 


1.9 


579 


X53744 


Canine mRNA for 
68kDA subunit of 
signal recognition 
particle (SRP68) 


e-162 


SR68_CANFA 


SIGNAL 
RECOGNITION 
PARTICLE 68 KD 
PROTEIN (SRP68) 


5.00E-16 


580 


AF086438 


Homo sapiens full 
length insert 
cDNA clone 
ZD80G1 1 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 


581 


U15140 


Mycobacterium 
bovis ribosomal 
proteins IF-1 
complete cds, and 
S4 (rpsD) gene, 
partial cds 


1.3 


<NONE> 


<NONE> 


<NONE> 


582 


D 13292 


Human mRNA 
for ryudocan core 
protein 


e-166 


RSP4_ARATH 


40S RIBOSOMAL 
PROTEIN SA (P40) 
(LAMININ 
RECEPTOR 
HOMOLOG) 


1.4 


583 


S71022 


neoplasm-related 
CI 40 product 
human, thyroid 
carcinoma cells, 
mRNA, 670 nt] 


9.00E-30 


RL6_HUMAN 


60S RIBOSOMAL 
PROTEIN L6 (TAX- 
RESPONSIVE 
ENHANCER 
ELEMENT 
BINDING PROTEIN 
107) (TAXREB107) 


5.6 


584 


L20934 


Anopheles 
gambiae complete 
mitochondrial 
genome 


0.014 


<NONE> 


<NONE> 


<NONE> 


585 


Z49269 


H.sapiens gene 
for chemokine 
HCC-1. 


1.1 


AMY1_DICTH 

] 


ALPHA-AMYLASE 
1 (EC 3.2.1.1) (1,4- 
ALPHA-D- 
GLUCAN 

3LUCANOHYDRO 
LASE) 


2.5 
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DESCRIPTION 


P VALUE 


586 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-04 


<NONE> 


<NONE> 


<NONE> 


587 


AF029893 


Homo sapiens i- 

beta-l,3-N- 

acetylglucosamin 

yltransferase 

mRNA, complete 

cds 


0.13 


HEMO_PIG 


HEMOPEXIN 
PRECURSOR 
(H Y ALURONI DA S 
E)(EC 3.2.1.35) 


3.5 


588 


J05109 


T.thermophila 
calcium-binding 
25 kDa (TCBP 
25) protein gene, 
complete cds. 


0.014 


<NONE> 


<NONE> 


<NONE> 


589 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


<NONE> 


<NONE> 


<NONE> 


590 


AF060246 


Mus musculus 
strain C57BL/6 
zinc finger protein 
106(Zfpl06) 
mRNA, H3a-a . 
allele, complete 
cds 


l.OOE-83 


SCRB_PEDPE 


SUCROSE-6- 
PHOSPHATE 
HYDROLASE (EC 
3.2.1.26) 
(SUCRASE) 


10 


591 


Y11966 


B.aphidicola (host 
T.suberi) plasmid 
pBTsl genes 
leuA, hspA, 
repA2, repAl, 
leuB, leuC, leuD, 
leuA 


0.37 


<NONE> 


<NONE> 


<NONE> 


592 


U20428 


Human SNC19 
mRNA sequence 


1.00E-64 


YY22JVIYCTU 


HYPOTHETICAL 
30.8 KD PROTEIN 
CY49.22 


0.29 


593 


AF043084 


Lycopersicon 
esculentum 
ethylene receptor 
homolog (ETR1) 
mRNA, complete 
cds 


0.37 


KNIR_DROME 


ZYGOTIC GAP 
PROTEIN KNIRPS 


9.9 


594 


X65279 


pWE15 cosmid 
vector DNA 


5.00E-66 


COA1_SV40 


COAT PROTEIN 
VP1 


0.001 


595 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 


0.041 


UL88_HSV7J 


PROTEIN U59 


5.8 
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SEQ 
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nF^PRIPTTOM 
JL/JDov^r\ir I 1 WIN 


P VAI I IF 
r VnLUL 




DfcbCKJPJ ION 


P VALUE 






cds 










596 


M91452 


Sus scrofa 
ryanodine 
receptor (RYR1) 
gene, complete 
cds. 


3.2 


<NONE> 


<NONE> 


<NONE> 


597 


U77327 


Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 


e-158 


GAT1_CHICK 


ERYTHROID 
TRANSCRIPTION 
FACTOR (GATA-1) 
(ERYF1) 


1.2 


598 


U77327 


Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 


0 


RPB7_ARATH 


DNA-DIRECTED 
RNA 

POLYMERASE II 
19 KD 

POLYPEPTIDE (EC 
2.7.7.6) (RNA 
POLYMERASE II 
SUBUNIT 5) 


6.2 


599 


Y16964 


Saccharomyces 
sp. mitochondrial 
DNA for OLI1 
gene, strain CID1 


0.37 


NMD5_YEAST 


NONSENSE- 
MEDIATED MRNA 
DECAY PROTEIN 5 


1.9 


600 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


<NONE> 


<NONE> 


<NONE> 


601 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


602 


AF091046 


Brugia pahangi 
nuclear hormone 
receptor (bhr-1) 
gene, partial cds 


1.1 


INVO_PONPY 


NVOLUCRIN 


0.23 


603 


M87339 


Human 

replication factor 

J /-KU2L SUDUnil 

mRNA, complete 
cds 


0 


AC12 HUMAN 

i 
( 


ACTIVATOR 1 37 
KD SUBUNIT 
REPLICATION 
FACTOR C 37 KD 
SUBUNIT) (A 1 37 
KD SUBUNIT) (RF- 
2 37 KD SUBUNIT) 
;RFC37) 


1.00E-38 


604 


D28116 

( 
< 
( 


Human genes for 
sollagen type IV 
ilpha 5 and 6, 
jxon 1 and exon 


0.39 


<NONE> 


<NONE> 


<NONE> 
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r 










605 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


606 


AE001149 


Borrelia 
burgdorferi 
(section 35 of 70) 
of the complete 
genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


607 


X14168 


Human pLC46 
with DNA 
replication origin 


6.00E-16 


Z136_HUMAN 


ZINC FINGER 
PROTEIN 136 


0.31 


608 


Z57610 


H. sapiens CpG 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla. 


7.00E-90 


HN3BJRAT 


HEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 


1.00E-19 


609 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.043 


PGCV_MOUSE 


VERSICAN CORE 

PROTEIN 

PRECURSOR 

(LARGE 

FIBROBLAST 

PROTEOGLYCAN) 

(CHONDROITIN 

SULFATE 

PROTEOGLYCAN 

CORE PROTEIN 2) 

(PG-M) 


3.5 


610 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


7.00E-07 


CA1 1_CHICK 


PROCOLLAGEN 
ALPHA 1(1) CHAIN 
PRECURSOR 


0.4 


611 


AB007956 


Homo sapiens 
mRNA, 
chromosome 1 
specific transcript 
KIAA0487 


e-106 


RRPB_CVMA5 


RNA-D1RECTED 
RNA 

POLYMERASE (EC 
2.7.7.48) (ORF1B) 


9.7 


612 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


<NONE> 


<NONE> 


<NONE> 
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613 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


UL52_EBV 


HELICASE/PRIMA 
SE COMPLEX 
PROTEIN 
(PROBABLE DNA 
REPLICATION 
PROTEIN BSLF1) 


5.9 


614 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 


3.00E-71 


POLG_PVYHU 


GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 


4.3 


615 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-09 


VP3_ROTPC 


INNER CORE 
PROTEIN VP3 


7.7 


616 


J05499 


Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds 


e-143 


GLSL_RAT 


GLUTAMINASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) (GLS) 


7.00E-67 


617 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, complete 
cds. 


0.37 


Y642_METJA 


HYPOTHETICAL 
PROTEIN MJ0642 


5.8 


618 


M2U91 


Human aldolase 
pseudogene 
mRNA, complete 
cds. 


1.00E-32 


LINl_NYCCO 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6.00E-17 


619 


. U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


NUCM_BOVIN 


NADH- 
UBIQUINONE 
OXIDOREDUCTAS 
E49 KD SUBUNIT 
(EC 1.6.5.3) (EC 
1.6.99.3) 
(COMPLEX 1- 
49KD) (CI-49KD) 


0.044 
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P VALUE 
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DESCRIPTION 


P VALUE 


620 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.005 


HEMZ_RHOCA 


FERROCHELATAS 
E(EC 4.99.1.1) 
(PROTOHEME 
FERRO-LYASE) 


4.4 


621 


AF041428 


Homo sapiens 
ribosomal protein 
s4 X isoform 
gene, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


622 


X07158 


Chironomus 
thummi DNA for 
Cla repetitive 
element 


0.13 


<NONE> 


<NONE> 


<NONE> 


623 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-04 


<NONE> 


<NONE> 


<NONE> 


624 


AF100470 


Rattus norvegicus 
ribosome attached 
membrane protein 
4 (RAMP4) 
mRNA, complete 
cds 


1.00E-53 


<NONE> 


<NONE> 


<NONE> 


625 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 


2.00E-38 


<NONE> 


<NONE> 


<NONE> 


626 


Ml 3452 


Human lamin A 
mRNA, 3'end. 


6.00E-16 


<NONE> 


<NONE> 


<NONE> 


627 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.014 


ACDV_RAT 


ACYL-COA 
DEHYDROGENAS 
E, VERY-LONG- 
CHAIN SPECIFIC 
PRECURSOR (EC 
1.3.99.-) (VLCAD) 


4.00E-20 


628 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


629 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


630 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-05 


<NONE> 


<NONE> 


<NONE> 
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631 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


| <NONE> 


<NONE> 


<NONE> 


632 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-05 


YS83_CAEEL 


HYPOTHETICAL 
86.9 KD PROTEIN 
ZK945.3 IN 
CHROMOSOME II 


0.65 


633 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


NRP_MOUSE 


NEUROPIL1N 
PRECURSOR (A5 
PROTEIN) 


2.7 


634 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-05 


Y4JN_RHISN 


HYPOTHETICAL 
16.3 KD PROTEIN 
Y4JN 


5.9 


635 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-05 


<NONE> 


<NONE> 


<NONE> 


636 


X64707 


H.sapiens BBC1 
mRNA 


e-179 


RL13_HUMAN 


60S RIBOSOMAL 
PROTEIN L13 
(BREAST BASIC 
CONSERVED 
PROTEIN 1) 


5.00E-40 


637 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-08 


<NONE> 


<NONE> 


<NONE> 


638 


X14168 


Human pLC46 
with DNA 
replication origin 


5.00E-14 


SP3_HUMAN 


TRANSCRIPTION 
FACTOR SP3(SPR- 
2) (FRAGMENT) 


0.19 


639 


X90999 


H.sapiens mRNA 
for Glyoxalase II 


9.00E-20 


GL02_HUMAN 


HYDROXY ACYLG 
LUTATHIONE 
HYDROLASE (EC 
3.1.2.6) 


0.007 


640 


AF083322 


Homo sapiens 
centriole 

associated protein 
CEPllOmRNA, 
complete cds 


9.00E-51 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


0.005 
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Z12002 


M.musculusPvt-1 
mRNA. 


0.36 


CP5F_CANTR 


CYTOCHROME 
P450 LIIA6 
(ALKANE- 
INDUCIBLE)(EC 
1.14.14.1) (P450- 
ALK3) 


5.6 


642 


Ml 0206 


R.sphaeroides 
reaction center L 
subunit (complete 
cds) and M 
subunit (5' end) 
genes. 


1.1 


YGR1_YEAST 


HYPOTHETICAL 
34.8 KD PROTEIN 
1NSUT1-RCK1 
INTERGENIC 
REGION 


0.006 


643 


K02668 


E. coli ddl gene 
encoding D- 
alanine:D-alanine 
ligase and ftsQ 
and ftsA genes, 
complete cds, and 
ftsZ gene, 5' end. 


3.3 


ANKB_HUMAN 


ANKYR1N, BRAIN 
VARIANT 1 
(ANKYRINB) 
(ANKYRIN, 
NONERYTHROID) 


7.00E-07 


644 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


645 


X53616 


C.domesticus 
calnexin (pp90) 
mRNA 


1.1 


<NONE> 


<NONE> 


<NONE> 


646 


X57010 


Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exonsE2-E15 


3.3 


PRIO_PIG 


MAJOR PRION 
PROTEIN 

PRECURSOR (PRP) 


1.9 


647 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, partial 
cds 


1.1 


UL07_HSV2H 


PROTEIN UL7 


7.3 


648 


X52956 


Human CAMII- 
psi3 calmodulin 
retropseudogene 


0.37 


PRTP_EBV 


PROBABLE 
PROCESSING AND 
TRANSPORT 
PROTEIN 


7.5 


649 


M93425 


Human protein 

tyrosine 

phosphatase 

(PTP-PEST) 

mRNA, complete 

cds. 


0 


PTNC_HUMAN 


PROTEIN- 
TYROSINE 
PHOSPHATASE Gl 
(EC 3.1.3.48) 
(PTPG1) 


e-107 


650 


L47615 


Mus musculus 
DNA-binding 
protein (Fli-1) 
gene, 5' end of 
cds. 


0.13 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03IN 
CHROMOSOME I 


2.00E-07 


651 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 


0 


GIL1_ENTHI 


GALACTOSE- 
INHIBITABLE 
LECTIN 170KD 


0.22 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 






SUBUNIT 




652 


U08813 


Oryctolagus 

cuniculus 

Na+/glucose 

cotransporter- 

related protein 

mRNA, complete 

cds. 


t.OOE-22 


NAG1_HUMAN 


SODIUM/GLUCOS 
E 

COTRANSPORTER 
1 (NA(+)/GLUCOSE 
COTRANSPORTER 
1)(HIGH 
AFFINITY 
SODIUM- 
GLUCOSE 
COTRANSPORTER 
) 


0.1 


653 


Y00282 


Human mRNA 
for ribophorin II 


2.00E-78 


RIB2_HUMAN 


DOLICHYL- 
DIPHOSPHOOLIG 
OSACCHA RIDE- 
PROTEIN 

GLYCOSYLTRANS 
FERASE 63 KD 
SUBUNIT 
PRECURSOR (EC 
2.4.1.119) 
(RIBOPHORIN II) 


5.00E-19 


654 


DI0051 


Human gene for 
92-kDa type IV 
collagenase, 5 - 
flanking region 


0.014 


TAGBJDICDI 


PRESTALK- 
SPECIFIC 
PROTEIN TAGB 
PRECURSOR (EC 
3.4.21.-) 


7.6 


655 


M29930 


Human insulin 
receptor (allele 2) 
gene, exons 14, 
15, 16 and 17. 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


656 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds 


0 


YG2S_YEAST 


HYPOTHETICAL 
69.9 KD PROTEIN 
IN MIC1-SRB5 
INTERGENIC 
REGION 


0.002 


657 


X68792 


S.coelicolor 
A3(2) promoter 
sequence pth270 


3.2 


YBS0_YEAST 


HYPOTHETICAL 
27.0 KD PROTEIN 
IN VAL1-HSP26 
INTERGENIC 
REGION 


0.073 


658 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU INHUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 
!!!! 


1.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


659 


U15522 


Sus scrofa clone 
pvgla Ig heavy 
chain variable 
VDJ region 
mRNA, partial 
cds. 


3.2 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


3.2 


660 


M20918 


C.thummi piger 
haemoglobin (Hb) 
gene DNA, 
complete cds. 


0.12 


YT25_CAEEL 


HYPOTHETICAL 
59.9 KD PROTEIN 
B0304.5 IN 
CHROMOSOME II 


0.033 


661 


U60337 


Homo sapiens 
beta- m annosi dase 
mRNA, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


662 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.001 


ENVJvILVFP 


ENV 

POLYPROTEIN 
PRECURSOR 
(CONTAINS: 
KNOB PROTEIN 
GP70; SPIKE 
PROTEIN P15E;R 
PROTEIN) 


3.3 


663 


M97287 


Human 

MAR/SARDNA 
binding protein 
(SATBl)mRNA, 
complete cds. > :: 
gb|I58691|I58691 
Sequence 1 from 
patent US 
5652340 


0 


SAT INHUMAN 


DNA-BINDING 
PROTEIN SATB1 
(SPECIAL AT-RICH 
SEQUENCE 
BINDING PROTEIN 
1) 


2.00E-20 


664 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds 


e-168 


K2C4_BOVIN 


KERATIN, TYPE II 
CYTOSKELETAL 
59 KD, 

COMPONENT IV 


4.00E-10 


665 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


e-152 


PLAPJUOUSE 


PHOSPHOLIPASE 
A-2-ACTIVATING 
PROTEIN (PLAP) 


4.00E-13 


666 


M73047 


Homo sapiens 
tripeptidyl 
peptidase II 
mRNA, complete 
cds. 


0 


MERT_STRLI 


MERCURIC 

TRANSPORT 

PROTEIN 

(MERCURY ION 

TRANSPORT 

PROTEIN) 


4.4 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


667 


U09954 


Human ribosomal 
protein L9 gene, 
5 ! region and 
complete cds. 


0 


RL9_HUMAN 


60S RIBOSOMAL 
PROTEIN L9 


2.00E-1 1 


668 


X98330 


H.sapiens mRNA 
for ryanodine 
receptor 2 


1.1 


HS74_MOUSE 


HEAT SHOCK 70 
KD PROTEIN AGP- 
2 


0.034 


669 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.002 


RPC2J>ROME 


DNA-DIRECTED 
RNA 

POLYMERASE III 
128 KD 

POLYPEPTIDE 


1.1 


670 


AF069250 


Homo sapiens 

okadaic acid- 

inducible 

phosphoprotein 

(OA48-18) 

mRNA, complete 

cds 


7.00E-80 


LEGB_PEA 


LEGUMIN B 
(FRAGMENT) 


0.011 


671 


Z71419 


S cerevisiae 
chromosome XIV 
reading frame 
ORF YNL143c 


1.1 


FOCDJECOLI 


OUTER 
MEMBRANE 
USHER PROTEIN 
FOCD 

PRECURSOR 


9.7 


672 


AF044965 


Homo sapiens 
polio virus related 
protein 2 gene, 
alpha isoform, 
exon 6 and partial 
cds 


e-167 


PVR_MOUSE 


POLIOVIRUS 
RECEPTOR 
HOMOLOG 
PRECURSOR 


1.00E-12 


673 


X65319 


Cloning vector 
pCAT-Enhancer 


2.00E-80 


S106_HUMAN 


CALCYCLIN 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 
PROTEIN) 
CALCIUM- 
BINDING PROTEIN 
A6) 


3.00E-15 


674 


D29655 


Pig mRNA for 
UMP-CMP 
kinase, complete 
cds 


e-103 


V319_ASFB7 


J3 19 PROTEIN 


4.3 


675 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


VEGR_RAT 


VASCULAR 
ENDOTHELIAL 
GROWTH FACTOR 
RECEPTOR 1 
PRECURSOR 
RECEPTOR FLT) 
(FLT-1) 


3.3 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


676 


D90217 


S. cerevisiae gene 

forYmL33, 

mitochondrial 

ribosomal 

proteins of large 

subunit 


2.00E-07 


MALY_ECOLI 


MALY PROTEIN 
(EC 2.6.1.-) 


5.6 


677 


AF038952 


Homo sapiens 
cofactor A protein 
mRNA, complete 
cds 


e-160 


TlCA_MOUSE 


TCP1- 

CHAPERONIN 
COFACTOR A 


4.00E-19 


678 


Z96950 


Gorilla eorilla 

\JV4 lilt* tvl HIU 

DNA sequence 
orthologous to the 
human Xp:Yp 
telomere-junction 
region 


5.00E-14 


YHBZ_ECOLI 


HYPOTHETICAL 
43.3 KDGTP- 
BINDING PROTEIN 
IN DACB-RPMA 
INTERGENIC 
REGION (F390) 


3.3 


679 


D50418 


Mouse mRNA for 
AREC3, partial 
cds 


2.00E-79 


CYGX_RAT 


OLFACTORY 
GUANYLYL 
CYCLASE GC-D 
PRECURSOR (EC 
4.6.1.2) 


1.1 


680 


U95098 


Xenopus laevis 
mitotic 

nhosnhoorotein 

LSI IVtf wllVlrJ V Ivil 1 

44 mRNA, partial 
cds 


8.00E-08 


P2C2_SCHPO 


PROTEIN 

PHOSPHATASE 2C 
HOMOLOG 2 (EC 
3.1.3.16) 


1.00E-04 


681 


AL010280 


Plasmodium 
falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-106, complete 
sequence 


0.12 


<NONE> 


<NONE> 


<NONE> 


682 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


5.00E-04 


VSM2_TRYBB 


VARIANT 

SURFACE 

GLYCOPROTEIN 

MITAT1.2 

PRECURSOR (VSG 

221) 


4.3 


683 


U00238 


Homo sapiens 
glutamine PRPP 
amidotransferase 
(GPAT) mRNA, 
complete cds 


0 


<NONE> 


<NONE> 


<NONE> 


684 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.005 


PRPR_SALTY 


PROPIONATE 

CATABOLISM 

OPERON 

REGULATORY 

PROTEIN 


1.5 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


685 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-07 


YAND_SCHPO 


HYPOTHETICAL 
30.4 KD PROTEIN 
C3H1.13IN 
CHROMOSOME I 


0.38 


686 


D25538 


Human mRNA 
for KIAA0037 
gene, complete 
cds 


0 


<NONE> 


<NONE> 


<NONE> 


687 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
eomnlete cds 


2.00E-07 


A1AA_RAT 


ALPHA-1A 
ADRENERGIC 
RECEPTOR (RA42) 


4.4 


688 


L26956 


Mesocricetus 
auratus stearyl- 
CoA desaturase 
sequence 

including male 

hormone 
dependent gene 
derived from 
hamster 
frankorgan 


4.00E-33 


<NONE> 


<NONE> 


<NONE> 


689 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


690 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


Y093_CAEEL 


HYPOTHETICAL 
58.5 KD PROTEIN 
T20B12.3 IN 
CHROMOSOME III 


2.00E-08 


691 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


8.00E-09 


<NONE> 


<NONE> 


<NONE> 


692 


ABO 17026 


Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 


0 


OXYB_RABIT 


OXYSTEROL- 
BINDING PROTEIN 


1.00E-34 


693 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-04 


UF02_MAIZE 


FLAVONOL 3-0- 
GLUCOSYLTRANS 
FERASE (EC 
2.4.1.91) 


3.1 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


694 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


5.00E-04 


<NONE> 


<NONE> 


<NONE> 


695 


U34954 


Caenorhabditis 
elegans 
cyclophilin 
isoform 10 


5.00E-24 


CYPA_CAEEL 


PEPTIDYL- 
PROLYL CIS- 
TRANS 

ISOMERASE 10 
(EC 5.2.1.8) 


2.00E-29 


696 


AB011167 


Homo sapiens 
mRNA for 
K1AA0595 
protein, partial 
cds 


0 


RFX5_HUMAN 


BINDING 

REGULATORY 

FACTOR 


2.1 


697 


U03886 


Human GS2 
mRNA, complete 
cds. 


2.00E-28 


SKDl_MOUSE 


SKD1 PROTEIN 


4.00E-17 


698 


AF086275 


Homo sapiens full 
length insert 
cDNA clone 
ZD45C02 


3.00E-41 


SPT7_YEAST 


TRANSCRIPTIONA 
L ACTIVATOR 
SPT7 


0.82 


699 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


CA1EHUMAN 


COLLAGEN 
ALPHA 1(XV) 
CHAIN 
PRECURSOR 


1.1 


700 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-1 1 


E434_ADECC 


Q65962 canine 
adenovirus type 1 
(strain ell), early e4 
31 kd protein. 11/98 


4.4 


701 


LI 7340 


Drosophiia 
melanogaster 
germline 
transcription 
factor gene, 
complete cds. 


3.3 


CISYJTETTH 


CITRATE 
SYNTHASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
4.1.3.7) (14 NM 
FILAMENT- 
FORMING 
PROTEIN) 


9.7 


702 


X58170 


Mmusculus 
mRNA fort- 
Complex Tcp- 10a 
gene 


2.00E-45 


PME2_LYCES 


PECTINESTERASE 
2 PRECURSOR (EC 
3.1.1.11)(PECTIN 
METHYLESTERAS 
E) (PE 2) 


7.4 


703 


Z96207 


H. sapiens ! 
telomeric DNA 1 
sequence, clone 
12PTEL049,read 
12PTELOO049.se 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


a rpccc TAXI 




r VALUL 






q 










704 


X58430 


Human Hoxl.8 
gene 


e-146 


HXAA_HUMAN 


HOMEOBOX 
PROTEIN HOX-AI0 
(HOX-1H) (HOX- 
1.8) (PL) 


4.00E-05 


705 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


6.00E-06 


YN39_SYNP7 


HYPOTHETICAL 
9.2 KD PROTEIN 
IN CYST-CYSR 
INTERGENIC 
REGION (ORF 81) 


0.89 


706 


\Jyj\Jy £ t 


.AcnopUb IdCVlD 

XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1 .WL" 1 J 


1V1 I Oil Dv V UN 


MYOSIN I HFAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


0.001 


707 


Nyf 1QQ£1 


il Ul 11 all 

cytochrome c 
oxidase subunit 
Vb(coxVb) 
mRNA, complete 




OTHIJ5B 


<NONE> 


3 OOE-30 


708 


X68380 


M.musculus gene 
for cathepsin D, 
exon3 


5.00E-04 


42_MOUSE 


ERYTHROCYTE 
MEMBRANE 
PROTEIN BAND 
4.2 (P4.2) 
(PALLIDIN) 


9.9 


709 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


1.00E-11 


TCPA_DROME 


T-COMPLEX 
PROTEIN 1, 
ALPHA SUBUNIT 
(TCP- 1 -ALPHA) 


4.3 


710 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-10 


<NONE> 


<NONE> 


<NONE> 


711 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-12 


<NONE> 


<NONE> 


<NONE> 


712 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


713 


AB018323 


Homo sapiens 
mRNA for 
KIAA0780 
protein, partial 
cds 


3.00E-41 


LBR_CHICK 


LAMIN B 
RECEPTOR 


3.4 


714 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


YM8L_YEAST 


HYPOTHETICAL 
71.1 KD PROTEIN 
IN DSK2-CAT8 
INTERGENIC 
REGION 


3.O0E-08 


715 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-13 


PSCDROME 


POSTERIOR SEX 
COMBS PROTEIN 

^ 


0.6 


716 


L28101 


Homo sapiens 
kallistatin (PI4) 
gene, exons 1-4, 
complete cds 


7.00E-07 


IRKXRAT 


INWARD 
RECTIFIER 
POTASSIUM 
CHANNEL BIR9 
(KIR5.1) 


5.4 


717 


AC001038 


Homo sapiens 
(subclone 2 h2 
from PI H49) 
DNA sequence 


8.00E-09 


MGMT_YEAST 


METHYLATED- 
DNA— PROTEIN- 
CYSTEINE 
METHYLTRANSFE 
RASE 


0.48 


718 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


YWDE_BACSU 


HYPOTHETICAL 
19.9 KD PROTEIN 
IN SACA-UNG 
INTERGENIC 
REGION 
PRECURSOR 


1.8 


719 


U01139 


Mus musculus 
B6D2F1 clone 
2C1 IB mRNA. 


e-110 


GSC_DROME 


HOMEOBOX 

PROTEIN 

GOOSECOID 


7.2 


720 


AB017430 


Homo sapiens 
mRNA for 
kinesin-like DNA 
binding protein, 
complete cds 


0 


YBAV_ECOLI 


HYPOTHETICAL 
12.7 KD PROTEIN 
IN HUPB-COF 
INTERGENIC 
REGION 


0.17 


721 


U95094 | 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.001 


CPCF_SYNP2 


PHYCOCYANOBIL 
IN LYASE BETA 
SUBUNIT(EC4.-.-.- 
) 


2.4 


722 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-10 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


723 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.04 


YKK7_CAEEL 


HYPOTHETICAL 
54.9 KD PROTEIN 
C02F5.7 IN 
CHROMOSOME III 


0.057 


724 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


8.00E-08 


H5_CAIMO 


HISTONE H5 


0.39 


725 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


3.00E-09 


DED1_YEAST 


PUTATIVE ATP- 
DEPENDENTRNA 
HELICASE DED1 


0.5 


726 


J04617 


Human elongation 
factor EF-1 -alpha 
gene, complete 
cds. > :: 

dbj|E02629|E0262 
9DNAofhuman 
polypeptide chain 
elongation factor- 
1 alpha 


5.00E-36 


ALU7_HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 
!!!! 


0.84 


727 


X54859 


Porcine TNF- 
alphaandTNF- 
beta genes for 
tumour necrosis 
factors alpha and 
beta, respectively. 


3.3 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


5.6 


728 


D49911 


Thermus j 
thermophilus 
UvrA gene, 
complete cds 


0.014 


CC48_CAPAN 


CELL DIVISION j 
CYCLE PROTEIN 
48 HOMOLOG 


9.9 


729 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-06 


CA25_HUMAN 


PROCOLLAGEN 
ALPHA 2(V) j 
CHAIN 
PRECURSOR 


0.011 


730 


D15057 


Human mRNA 
for DAD- 1, 
complete cds 


0 


DAD1_HUMAN 


DEFENDER 
AGAINST CELL 
DEATH 1 (DAD- 1) 


8.00E-16 


731 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


6.00E-06 


ANFD_RHOCA 


NITROGENASE | 
IRON-IRON 
PROTEIN ALPHA 
CHAIN (EC 
1.18.6.1) 

(NITROGENASE 
COMPONENT I) 
(DINITROGENASE 


9.6 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












) 




732 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


7.00E-07 


EFTUCHLVI 


ELONGATION 
FACTOR TU (EF- 
TU) 


2.5 


733 


ABO 18335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


0 


TRYM_RAT 


MAST CELL 
TRYPTASE 
PRECURSOR (EC 
3.4.21.59) 


5.6 


734 


X98743 


H.sapiens mRNA 
for RNA helicase 
(Myc-regulated 
dead box protein) 


0.04 


<NONE> 


<NONE> 


<NONE> 


735 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


2.00E-07 


<NONE> 


<NONE> 


<NONE> 


736 


Z49314 


S.cerevisiae 
chromosome X 
reading frame 
ORF YJL039c 


3.2 i 


<NONE> 


<NONE> 


<NONE> 


737 


D12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


2.00E-76 


738 


J04038 


Human 

glyceraldehyde-3- 

phosphate 

dehydrogenase 


2.00E-47 


SDC1_HUMAN 


SYNDECAN-1 
PRECURSOR 
(SYND1)(CD138) 


3.5 


739 


AFO 10238 


Homo sapiens 
von Hippel- 
Lindau tumor 
suppressor 


1.00E-09 


LIN1_HUMAN 


LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


0.001 


740 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


YQJX_BACSU 


HYPOTHETICAL 
13.2 KD PROTEIN 
IN GLNQ-ANSR 
INTERGENIC 
REGION 


9.9 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


741 


L21186 


Human lysyl 
oxidase-like 
protein mRNA, 
complete cds. 


e-145 


OXRTL 


<NONE> 


1.00E-34 


742 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-Q5 


CC48_SOYBN 


CELL DIVISION 
CYCLE PROTEIN 
48 HOMOLOG 
(VALOSIN 
CONTAINING 
PROTEIN 
HOMOLOG) (VCP) 


7.6 


743 


AF009203 


Homo sapiens 
YAC clone 
377A1 unknown 
mRNA, 
3'untranslated 
region 


3.3 


<NONE> 


<NONE> 


<NONE> 


744 


Z74894 


S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL152W 


0.12 


CD14_RABIT 


Q28680 oryctolagus 
cuniculus (rabbit), 
monocyte 
differentiation 
antigen cdl4 
precursor. 11/98 


1.9 


745 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


9.00E-10 


KIN3_YEAST 


SERINE/THREONI 
NE-PROTEIN 
KINASE KIN3 (EC 
2.7.1.-) 


2.5 


746 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-05 


YA53_SCHPO 


HYPOTHETICAL 
24.2 KD PROTEIN 
C13A11.03IN 
CHROMOSOME I 


7.00E-17 


747 


S61044 


ALDH3=aldehyd 
e dehydrogenase 
isozyme 3 
[human, stomach, 
mRNA Partial, 
1362 nt] 


0 


DHAP_HUMAN 


ALDEHYDE 
DEHYDROGENAS 
E, DIMERIC NADP- 
PREFERRING (EC 
1.2.1.5) (CLASS 3) 


2.00E-71 


748 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2.00E-08 


CA1E_CHICK 


COLLAGEN 
ALPHA l(XIV) 
CHAIN 
PRECURSOR 
(UNDULIN) ! 


0.36 


749 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


<NONE> 


<NONE> 


<NONE> 
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ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


750 


L14815 


Entamoeba 
histolytica HM- 
1:IMSS galactose- 
specific adhesin 
170kD subunit 
(hgl3) gene, 
complete cds. 


0.12 


<NONE> 


<NONE> 


<NONE> 


751 


X63785 


T.thermophila 
gene for snRNA 
U2-2 


1.1 


<NONE> 


<NONE> 


<NONE> 


752 


M83756 


Mytilus edulis 
mitochondrial 
NADH 

dehydrogenase 
subunit 5 (ND5) 
gene, 3' end; 
NADH 

dehydrogenase 
subunit 6 (ND6) 
gene, complete 
cds; and 

cytochrome b (cyt 
b), 5' end. 


0.042 


DSC1JHUMAN 


DESMOCOLLIN 
1A/1B 

PRECURSOR 
(DESMOSOMAL 
GLYCOPROTEIN 
2/3) (DG2/DG3) 


2.6 


753 


AB001066 


Brown trout 
microsatellite 
DNA sequence 


0.38 


IMB3_HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


1.2 


754 


AF064787 


Lotus japonicus 
rac GTPase 
activating protein 
1 mRNA, 
complete cds 


0.51 i 


<NONE> 


<NONE> 


<NONE> 


755 


U20608 


Dictyostelium 
discoideum 
unknown spore 
germination- 
specific protein- 
like protein, orfl, 
orf2 and orfi 
genes, complete 
cds 


0.043 


<NONE> 


<NONE> 


<NONE> 


756 


M77812 


Rabbit myosin 
heavy chain 
mRNA, complete 
cds. 


1.2 j 


RBL INHUMAN 


RETINOBLASTOM 
A-LIKE PROTEIN 1 
(107 KD 

RETINOBLASTOM 
A-ASSOCIATED 
PROTEIN) (PRB1) 
(P107) 


4.9 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 

1W 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION j 


DESCRIPTION 


P VALUE 


X63789 


T.thermophila 
genes for snRN A 
U5-l,snRNA U5- 
2 


0.058 j 


<NONE> 


<NONE> 


<NONE> 


758 


D50646 


Mouse mRNA for 
SDF2, complete 
cds 


2.00E-27 


PMT3_YEAST 


DOLICHYL- 

PHOSPHATE- 

MANNOSE-- 

PROTEIN 

MANNOSYLTRAN 

SFERASE 3 (EC 

2.4.1.109) 


0.002 


759 


L81583 


Homo sapiens 
(subclone 3 _g2 
from PI Hll) 
DNA sequence 


3.00E-19 


ALU5_HUMAN 


!!!! ALU 

SUBFAMILY SC 
WARNING ENTRY 
!!!! 


0.86 


760 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


SYFA_YEAST 


PHENYLALANYL- 
TRNA 

SYNTHETASE 
ALPHA CHAIN 
CYTOPLASMIC 


5.7 


761 


AF000370 


Homo sapiens 
polymorphic CA 
dinucleotide 
repeat flanking 
region 


6.00E-89 


APPl_MOUSE 


AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(APLP) 


5.7 


762 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


763 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


7.00E-06 


PSF_HUMAN 


PTB-ASSOCIATED 
SPLICING FACTOR 
(PSF) 


0.72 


764 


AB018288 


Homo sapiens 
mRNA for 
KIAA0745 
protein, partial 
cds 


0 


TC2A_CAEBR 


TRANSPOSABLE 
ELEMENT TCB2 
TRANSPOSASE 


1.5 


765 


AF020282 


Dictyostelium 
discoideum 
DG2033 gene, 
partial cds 


| 0.38 


PMT2JYEAST 


DOLICHYL- 

PHOSPHATE- 

MANNOSE- 

PROTEIN 

MANNOSYLTRAN 

SFERASE 2 (EC 

2.4.1.109) 


0.18 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


766 


AF017357 


Oryza sativa low 
molecular early 
iight-inducible 
protein mRNA, 
complete cds 


0.38 


RGS3_HUMAN 


REGULATOR OF 
G-PROTEIN 
SIGNALLING 3 
(RGS3)(RGP3) 


0.23 


767 


U67599 


Methanococcus 
jannaschii section 
141 of 150 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


768 


X74178 


B.taurus 
microsatellite 
DNA INRA153 


0.13 


FAG1JSYNY3 


P73574 

synechocystis sp. 
(strain pec 6803). 3- 
oxoacyl-[acyl-carrier 
protein] reductase 1 
(eel. 1.1. 100) (3- 
ketoacyl- acyl carrier 
protein reductase 1). 
11/98 


5.00E-16 


769 


AF041858 


Mus musculus 
synaptojanin 2 
isoform delta 
mRNA, partial 
cds 


0.043 


CA44JHUMAN 


COLLAGEN 
ALPHA 4(IV) 
CHAIN 
PRECURSOR 


0.24 


770 


JO 1404 


Drosophila 
melanogaster 
m itochondri al 

1111 LUv 1 lv/1 1 \*k 1 Ml 

cytochrome c 
oxidase subunits, 
ATPase6, 7 
tRNAs (Trp, Cys, 
Tyr, Leu(UUR), 
Lys, Asp, Gly) 
genes, and 
unidentified 
reading frames 
A61, 2 and 3. 


0.021 


NU1M_CITLA 


NADH- 
UBIQUINONE 
OXIDOREDUCTAS 
E CHAIN 1 (EC 
1.6.5.3) 


7.2 


771 


AL022317 


Human DNA 
sequence from 
clone 140L1 on 
chromosome 
22ql3.1-13.31, 
complete 
sequence [Homo 
sapiens] 


3.00E-41 


ALU7_HUMAN 


!!!! ALU 

SUBFAMILY SQ 
WARNING ENTRY 
!!!! 


4.00E-08 


772 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-09 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


773 


AF095927 


Rattus norvegicus 
protein 

phosphatase 2C 
mRNA, complete 
cds 


0 


| P2C_PARTE 


PROTEIN 

PHOSPHATASE 2C 
(EC 3.1.3.16) (PP2C) 


1.00E-16 


774 


X87212 


H.sapiens mRNA 
for cathepsin C 


0 


CATCJJUMAN 


DIPEPTIDYL- 
PEPTIDASEI 
PRECURSOR (EC 
3.4.14.1) 


2.00E-46 


775 


X05283 


Drosophila 
melanogaster 
PKCG7 gene 
exons 7-14 for 
protein kinase C 


4.5 


<NONE> 


<NONE> 


<NONE> 


776 


X03558 


Human mRNA 
for elongation 
factor 1 alpha 
subunit 


0 


EF1 INHUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-1 -ALPHA- 1) 


1.00E-83 


777 


X06960 


Aspergillus 

nidulans 

mitochondrial 

DNA for 

cytochrome 

oxidase subunit 3, 

tRNA-Tyr 


0.23 


<NONE> 


<NONE> 


<NONE> 


778 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


YMT8_YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP116-FAR3 
INTERGENIC 
REGION 


5.00E-07 


779 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


NAT1_YEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


5.00E-23 


780 


U59706 


Galius gailus 
alternatively 
spliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


PPOL_SARPE 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


0.021 | 


781 


U57391 


Rattus norvegicus 
FceRI gamma- 
chain interacting 
protein SH2-B 
(SH2-B) mRNA, 
complete cds 


1.00E-84 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 






P VAI I IF 




DFSPRIPTION 


P VALUE 


782 


AB014591 


Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


7.00E-57 


SSGP_VOLCA 


SULFATED 
SURFACE 
GLYCOPROTEIN 
185 (SSG185) 


5.3 


783 


AJ008065 


Chrysolina bankii 
16S rRNA gene, 
mitotype B2 


0.043 


<NONE> 


<NONE> 


<NONE> 


784 


AF067212 


Caenorhabditis 
eiegans cosmid 
F37F2 


0.005 


MEK1_RAT 


MAPK/ERK 
KINASE KINASE 1 
(EC2.7.1.-)(MEK 
KINASE 1) 


4.5 


785 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


0.042 


<NONE> 


<NONE> 


<NONE> 


786 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9.00E-09 


<NONE> 


<NONE> 


<NONE> 


787 


Y13401 


Homo sapiens 
CD3 delta gene, 
enhancer 
sequence 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


788 


AE001O38 


Archaeoglobus 
fulgidus section 
69 of 172 of the 
complete genome 


0.13 


<NONE> 


<NONE> 


<NONE> 


789 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


790 


AF041463 


Manihot esculenta 
elongation factor 
1 -alpha 


1.4 


<NONE> 


<NONE> 


<NONE> 


791 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


HXA3_HAEIN 


HEME:HEMOPEXI 
N-BINDING 
PROTEIN 
PRECURSOR 


2.7 


792 


Z12112 


pWE15A cosmid 
vector DNA 


3.00E-29 


PKWAJTHECU 


PUTATIVE 
SERINE/THREONI 
NE-PROTEIN 
KINASE PKWA 
(EC 2.7.1.-) 


2.00E-04 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


793 


U85193 


Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 


4.00E-44 


<NONE> 


<NONE> 


<NONb> 


794 


U89331 


Human 

pseudoautosomal 
homeodomain- 
containing protein 
(PHOG)mRNA, 
complete cds 


7.00E-06 


NRL_HUMAN 


NEURAL RETINA- 
SPECIFIC 
LEUCINE ZIPPER 
PROTEIN (NRL) 


6.3 


795 


AF055666 


Mus musculus 
kinesin light chain 
2 (KIc2) mRNA, 
complete cds 


0.52 


PSPD_BOVIN 


PULMONARY 
SURFACTANT- 
ASSOCIATED 
PROTEIN D 
PRECURSOR 


0.33 


796 


L13321 


Homo sapiens 
iduronate-2- 
sulfatase (IDS) 
gene, exon 1, 
incomplete 5' end. 


0.14 


YRP2_YEAST 


HYPOTHETICAL 
84.4 KD PROTEIN 
IN RPC2/RET1 
3'REGION 


0.27 


797 


AL010270 


Plasmodium 

falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-96, complete 
sequence 


0.37 


YTH3_CAEEL 


HYPOTHETICAL 
75.5 KD PROTEIN 
C14A4.3 IN 
CHROMOSOME II 


2 


798 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.015 


1MB3_HUMAN 


IMPORTIN BETA-3 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 


0.063 


799 


U70139 


Mus musculus 
putative CCR4 
protein mRNA, 
partial cds 


0 


CCR4_YEAST 


GLUCOSE- 

REPRESSIBLE 

ALCOHOL 

DEHYDROGENAS 

E 

TRANSCRIPTIONA 
L EFFECTOR 
(CARBON 
CATABOLITE 
REPRESSOR 
PROTEIN 4) 


5.00E-1 1 


800 


L26507 


Mouse myocyte 
nuclear factor 
(MNF) mRNA, 
complete cds. 


3.00E-41 


MNF_MOUSE 


MYOCYTE 
NUCLEAR 
FACTOR (MNF) 


4.00E-18 
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SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


801 


U20527 


Mus musculus 
chemokine KC 
gene, 5* region. 


0 


GRO_MOUSE 


GROWTH REGULATED 
PROTEIN PRECURSOR 
(PLATELET-DERIVED 
GROWTH FACTOR- 
INDUCIBLE PROTEIN 
KC) (SECRETORY 
PROTEIN N51) 


1.00E-28 


802 


AF065482 


Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


MYSAJDROME 


MYOSIN HEAVY 
CHAIN, MUSCLE 


0.089 


803 


UUjoZJ 


K/fiic miicpiilnc 
IViUo IlJUbwUlUb 

pericentrin 
mRNA, 
complete cds. 






MAI F SPECIFIC 
SPERM PROTEIN 
MST84DD 


0.099 


804 


U67468 


Methanococcus 
jannaschii 
section 10 of 150 
of the complete 
genome 


0.4 


<NONE> 


<NONE> 


<NONE> 


805 


U14178 


Human tvne II 
IL-1 receptor 
gene, exon IB 


1.00E-19 


AMPH HUMAN 


AMPHIPHYSIN 


2.9 


806 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


4.00E-86 


807 


D17218 


Human HenG2 
3' region Mbol 
cDNA, clone 
hmd3g02m3 


e-136 


CA1A HUMAN 


COLLAGEN ALPHA 
1(X) CHAIN 
PRECURSOR 


3.00E-04 


808 


Z57610 


H.sapiens CpG 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla. 


e-102 


HN3B_MOUSE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-24 


809 


D14678 


Human mRNA 
for kinesin- 
related protein, 
partial cds 


0 


NCD_DROME 


CLARET 

SEGREGATIONAL 
PROTEIN 


1.00E-70 



284 



WO 99/33982 



Table 2 



PCT/US98/27610 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


810 


X56317 


Xiohoohorus 
maculatus 
Xmrk(proto- 
oncogene) gene 
for receptor 
tyrosine kinase. 


0.49 


WNlB_MOUSE 


WNT-10B PROTEIN 
PRECURSOR (WNT- 12) 


7.2 


811 


M36200 


Human 

synaptobrevin 1 
(SYBl)gene, 
exon 5. 


0.2 


VE2_HPV14 


REGULATORY 
PROTEIN E2 


3.1 


812 


M18157 


Human glandular 
kallikrein gene, 
complete cds. 


1.5 


EKLF_MOUSE 


ERYTHROID 
KRUEPPEL-LIKE 
TRANSCRIPTION 
FACTOR (EKLF) 


1.1 


813 


D25215 


Human mRNA 
forKIAA0032 
gene, complete 
cds 


1.9 | 


YXIS_SACER 


HYPOTHETICAL 28.9 
KD PROTEIN IN XIS 
5'REGION (ORF1) 


1.3 


814 


M96628 


Human gene 
sequence, 5' end. 


2.00E-06 


AGRI_DISOM 


AGRIN (FRAGMENT) 


9.5 


815 


Z57610 


H.sapiens CpG 
DNA, clone 
1 87al0, reverse 
read 

cpgl87al0.rtla. 


e-102 


HN3BJ40USE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-19 


816 


X14168 


Human pLC46 
with DNA 
replication origin 


5.00E-16 


ZN44_HUMAN 


ZINC FINGER PROTEIN 
44 (ZINC FINGER 
PROTEIN KOX7) 


1.6 


817 


M19262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.28 


LMAJDROME 


LAMIN1N ALPHA 
CHAIN PRECURSOR 


4.7 


818 


AF058055 


Mus musculus 
monocarboxylate 
transporter 1 


0.2 


<NONE> 


<NONE> 


<NONE> 


819 


AB014570 


Homo sapiens 
mRNA for 
KIAA0670 
protein, partial 
cds 


0.16 


YGR1_YEAST 


HYPOTHETICAL 34.8 
KD PROTEIN IN SUT1- 
RCK1 INTERGENIC 
REGION 


4.00E-06 


820 


M19262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.27 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.5 
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Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


1 P VALUE 


821 


Z54367 


H.sapiens gene 
for plectin 


0.29 


Y093_CAEEL 


HYPOTHETICAL 58.5 
KD PROTEIN T20B 12.3 
IN CHROMOSOME III 


1.00E-14 


822 


ABO 17026 


Mus musculus 
mRNA for 
oxysterol- 

VtinHlncT nrotpin 

Dinuiiig pruiciu, 

complete cds 


0 


OXYBHUMAN 


OX YSTEROL-B INDING 
PROTEIN 


2.00E-49 


823 


X58170 


M.musculus 
mRNA fort- 
Complex Tcp- 
10a gene 


1.00E-20 


UL52_HSV11 


DNA 

HELICASE/PRIMASE 
COMPLEX PROTEIN 
(DNA REPLICATION 
PROTEIN UL52) 


5.3 


824 


X58430 


Human Hoxl 8 

gene 


o 


HXAA HUMAN 


HOMEOBOX PROTEIN 

HOX-AIO(HOX-IH) 

(HOX-1.8)(PL) 


1 OOE-44 


825 


X53754 


Porcine 

sarcoplasmic/end 
oplasmic- 
reticulum Ca(2+) 
pump gene 2 3'- 
end region 


1.3 


<NONE> 


<NONE> 


<NONE> 


826 


AB005786 


Arabidopsis 
thalianatRNA- 
GIu gene 


0.46 


<NONE> 


<NONE> 


<NONE> 


827 


AB012130 


Homo sapiens 
SBC2 mRNA for 
sodium 
bicarbonate 
cotransporter2, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


828 


AB017430 


Homo sapiens 
mRNA for 
kinesin-like 
DNA binding 
protein, complete 
cds 


0 


YBAV_ECOLI 


HYPOTHETICAL 12.7 
KD PROTEIN IN HUPB- 
COF INTERGENIC 
REGION 


0.063 


829 


AB007886 


Homo sapiens 
KIAA0426 
mRNA, 
complete cds 


0.042 1 


YDF3_SCHPO 


PROBABLE 
EUKARYOTIC 
INITIATION FACTOR 
C17C9.03 


0.52 


830 


AB018335 


Homo sapiens 
mRNA for 
KIAA0792 
protein, complete 
cds 


e-172 


UROT_BOVIN 


TISSUE PLASMINOGEN 
ACTIVATOR 
PRECURSOR (EC 
3.4.21.68) 


0.86 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


831 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


9.00E-96 


832 


U38376 


Rattus 
norvegicus 
cytosolic 
phospholipase 
A2 mRNA, 
complete cds 


0.048 I 


<NONE> 


<NONE> 


<NONE> 


833 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8_HUMAN 


THYROID RECEPTOR 

TXTTT7D A r'TTXIO 

PROTEIN 8 (TRIP8) 


4.00E-86 


834 


U08110 i 


Mus musculus 
RNA1 homolog 
(Fug 1) mRNA, 
complete cds. 


8.00E-04 


YNW7_YEAST 


HYPOTHETICAL 68.8 
KD PROTEIN IN URE2- 
SSU72 INTERGENIC 
REGION 


0.02 


835 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


5 JO 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


837 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


5.00E-05 


GCS1_HUMAN 


MANNOSYL- 
OLIGOSACCHARIDE 
GLUCOSIDASE(EC 
3.2.1.106) 


9.2 


838 


U18657 


Haemophilus 
influenzae LeuA 
(leuA) gene, 
partial cds, DprA 
(dprA+), orf272 
and orfl93 
genes, complete 
cds, and PfkA 
(pfkA) gene, 
partial cds. 


0.01 


STE6_YEAST 


MATING FACTOR A 
SECRETION PROTEIN 
STE6 (MULTIPLE 
DRUG RESISTANCE 
PROTEIN HOMOLOG) 
(P-GLYCOPROTEIN) 


7 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


839 


U 12523 


Rattus 
norvegicus 
ultraviolet B 
radiation* 
activated UV98 
mRNA, partial 
sequence. 


1.00E-10 


YMT8JYEAST 


HYPOTHETICAL 36.4 
KD PROTEIN IN 
NUP116-FAR3 
INTERGENIC REGION 


2.00E-06 


840 


D78255 


Mouse mRNA 
for PAP-1, 
complete cds 


e-175 


<NONE> 


<NONE> 


<NONE> 


841 


D17263 


Human HepG2 
3* region Mbol 
cDNA, clone 
hmd5fl)7m3 


1.00E-58 


<NONE> 


<NONE> 


<NONE> 


842 


AF006751 


Homo sapiens 
ES/130 mRNA, 
complete cds 


0.061 


YRP2_YEAST 


HYPOTHETICAL 84.4 
KD PROTEIN IN 
RPC2/RET1 3'REGION 


2.00E-07 


843 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


6.00E-05 


YC14_METJA 


HYPOTHETICAL 
PROTEIN MJ1214 


8.1 


844 


D88689 


Mus musculus 
mRNA for flt-1, 
complete cds 


0.084 


ICP0_HSV2H 


TRANS-ACTING 
TRANSCRIPTIONAL 
PROTEIN ICP0 
(VMW1 18 PROTEIN) 


0.014 
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Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00001340B:A06 


17062 


3 


0 


0 


0 


0 


0 


M00001340D:F10 


11589 


2 


2 


1 


3 


3 


8 


M00001341A:E12 


4443 


10 


6 


2 


6 


3 


11 


M00001342B:E06 


39805 


2 


0 


0 


0 


1 


0 


M00001343C:F10 


2790 


7 


15 


13 


14 


6 


0 


M00001343D:H07 


23255 


3 


0 


1 


1 


0 


0 


M00001345A:E01 


6420 


8 


0 


2 


0 


1 


0 


M00001346A:F09 


5007 


4 


8 


3 


6 


2 


6 


M00001346D:E03 


6806 


5 


2 


1 


2 


0 


3 


M00001346D:G06 


5779 


5 


4 


3 


4 


0 


0 


M00001346D:G06 


5779 


5 


4 


3 


4 


0 


0 


M00001347A:B10 


13576 


5 


0 


0 


0 


12 


11 


M00001348B:B04 


16927 


4 


0 


0 


2 


0 


0 


M00001348B:G06 


16985 


4 


0 


0 


0 


0 


0 


M00001349B:B08 


3584 


5 


11 


5 


0 


0 


2 


M00001350A:H01 


7187 


5 


3 


1 


0 


1 


0 


M00001351B:A08 


3162 


10 


14 


1 


6 


6 


5 


M00001351B:A08 


3162 


10 


14 


1 


6 


6 


5 


M00001352A:E02 


16245 


4 


0 


0 


0 


0 


0 


M00001353A:G12 


8078 


4 


3 


1 


0 


1 


0 


M00001353D:D10 


14929 


4 


0 


0 


1 


23 


16 


M00001355B:G10 


14391 


3 


1 


0 


0 


0 


0 


M00001357D:D11 


4059 


8 


6 


8 


16 


0 


1 


M00001361A:A05 


4141 


5 


2 


10 


16 


4 


27 


M00001361D:F08 


2379 


26 


13 


4 


2 


2 


3 


M00001362B:D10 


5622 


7 


4 


2 


13 


1 


2 


M00001362C:H11 


945 


9 


21 


2 


1 


0 


0 


MO0O01365C:C10 


40132 


2 


0 


0 


0 


3 


0 


M00001370A:C09 


6867 


7 


3 


0 


0 


0 


0 


M00001371C:E09 


7172 


3 


5 


1 


2 


0 


1 


M00001376B:G06 


17732 


1 


3 


5 


0 


1 


4 


M00001378B:B02 


39833 


2 


0 


0 


0 


0 


0 


M00001379A:A05 


1334 


27 


38 


35 


28 


3 


0 


M00001380D:B09 


39886 


2 


0 


0 


0 


0 
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M00001382C:A02 


22979 


2 


1 


0 


0 


0 


0 


M00001383A:C03 


39648 


2 


0 


0 


0 


0 


0 


M00001383A:C03 


39648 


2 


0 


0 


0 


0 


0 


M00001386C:B12 


5178 


5 


5 


4 


2 


5 


2 


M00001387A:C05 


2464 


5 


19 


25 


16 


1 


0 


MO0O01387B:G03 


7587 


6 


2 


1 


0 


0 


0 


MO0O01388D:G05 


5832 


10 


3 


0 


1 


5 


0 


M00O01389A:C08 


16269 


3 


0 


0 


0 


1 


1 


M00001394A:F01 


6583 


2 


7 


3 


2 


0 


0 


M00001395A:C03 


4016 


5 


14 


0 


6 


0 


0 


M00001396A:C03 


4009 


6 


4 


13 


5 


4 


10 


M00001402A:E08 


39563 


2 


0 


0 


0 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 
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Table 5 All Differential Data for Libs 1-4 and 8-9 
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u 


0 

u 


n 

u 


0 

u 


moooo i s 1 9 a • a no 

MUUUU ID l ZA.AU7 


701 R^ 

J7IOO 


9 
z 


n 

u 


n 

u 


0 

u 


n 

u 


n 

u 


Mnnnn 1 s 1 9rvnoo 

MUUUU I D iZDXjyjy 


J7JO 


0 


0 


s 


9 
z 


n 

u 


n 


MOOOn 1 S 1 7 A *Rn6 
MUUUU 1 J I J A. DUO 


*tJUO 


10 

1U 


4 


0 

u 


0 


9 


0 


Mnnnn 1 s 1 7r , «P0R 




1 
1 


n 

u 


n 

u 


0 

u 


0 


0 


Mnnooi si Ap-m 1 

IVIUUUU 1 J \ t^.JL/ 1 1 


40044 


9 


n 

u 


n 

u 


0 

u 


0 


n 


Moonn 1 s 1 7 a -Rn7 

IVIUUUU Ul /A.DU/ 


4717 


17 


U 


1 

1 


0 

u 


1 

J 


n 

u 


Mnnnn 1 si rp-ri 1 

IVIUUUU IJI oV-.O 1 1 


R0S9 


7 


4 


n 

u 


4 


9 


0 


KyfAAAAl SOft A -PA/I 
MUUUU 1 jzoA.CU4 


7777 


^ 




J 


1 6 

10 


19 
1Z 


91 
Z I 


M0O001528A:FO9 


18957 


3 


0 


0 


0 


0 


0 


M00001528B:H04 


8358 


3 


3 


2 


0 


0 


0 


M00001531A:D01 


38085 


2 


0 


0 


0 


0 


0 


M00001532B:A06 


3990 


6 


12 


4 


1 


3 


1 


M00001533A:C11 


2428 


14 


14 


13 


9 


2 


19 


M00001534A:C04 


16921 


4 


0 


0 


1 


2 


1 


M00001534A:D09 


5097 


6 


5 


1 


1 


3 


2 


M00001534A:F09 


5321 


11 


7 


1 


5 


10 


26 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone iName 


ciusier 


Clones in 


cjones in 


atones in 


ciuucs in 


i^iones in 






in 




Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


M00001534OA01 


4119 


9 


4 


2 


2 


5 


3 


M00001535A-B01 


7665 


3 


1 


5 


0 


0 


0 


Mnn001S35A*C06 


20212 


2 


o 


1 


1 


0 


0 


M00001535A*F10 


39423 


2 


0 


0 


0 


0 


0 


M00001536A*B07 

IV1UUUU 1 JJVii.iJU / 


2696 


23 


11 


9 


18 


10 


21 


M00001 536A-C08 


39392 


2 


o 


0 


0 


0 


0 


M00001 537A-F12 


39420 


2 


o 


o 


0 


0 


0 


M00001537B*G07 


3389 


4 


11 


13 


2 


0 


0 


mooooi 540A*no6 


8286 


6 


1 


o 


3 


4 


0 


MOOOOI 541 A 'DO? 


3765 


19 


6 


o 


o 


o 


0 


moooo 1 54 1 a -F07 


92085 


3 


o 


o 


0 


o 


1 


M00001541 A-H03 


30174 


2 


o 


0 


o 


0 


o 


moooo 1 549 a • a oq 


99 11 3 




o 


o 


o 


o 


o 


MOOOO 1 549 A «F06 


30453 


9 


o 


o 


o 


0 


o 


mooooi 544 a «F03 


19170 

1 Z. 1 /v 


9 


1 

I 


2 


o 


o 


o 


MOOOOI 544A-G09 


10820 


9 


o 


1 


o 


o 


o 


MOOOO 1 544R*R07 




7 


J 


0 


1 


o 


o 


MOOOO 1 545 A 'P03 


10955 


9 


o 


o 


o 


o 


o 


MOOO0 1 545A*n08 


13864 




o 


2 


1 


2 


4 


MOOOOI 546A*fi1 1 


1967 


43 


55 


5 


o 


0 


0 


Mnnno i 548 a »f i n 


5892 

JO/A 


5 


] 


4 


4 


1 


3 


mooooi 548a-hoo 


1058 


40 

"TV 


44 


37 


47 


39 


59 


mooooi 54Q a -R09 


4015 


10 
I u 


5 


& 

o 


15 


2 


0 


mooooi 540A*noK 


10044 


7 
j 


n 

w 


j 


1 


o 


7 


mooooi 540R-F06 


4103 


19 


7 




2 


o 


1 


mooooi 54or»F06 


16347 


4 


o 


0 


o 


o 


o 


MOOO0 1 550A-A03 


7930 


5 


2 


1 


0 


2 


0 


mooooi 550 a -go i 




o 


1 


3 


2 


o 


0 


MOOOOI 551 A-B10 

IVL\J\J\J\J I J J lf\»0 1 V 


6268 


6 


4 


3 


18 


5 


0 


MOOOOI 551 A -F05 


39180 


2 


o 


o 


0 


0 


0 


M00001551A-G06 


22390 


2 


1 


o 


0 


0 


1 


MOOOOI 55 1C-G09 


3266 


12 


14 


o 


1 


0 


6 


MOOOOI 552A-B12 


307 


73 


60 


196 


75 


79 


27 


M00001552A 4 D11 


39458 


2 


o 


0 


0 


0 


0 


MOOOOI 552B-D04 


5708 


5 


4 


4 


3 


1 


4 


MOOO0 1553 A -H06 


8298 


4 


3 


1 


3 


o 


0 


M00001553B-F12 


4573 


5 


7 


2 


5 


0 


1 


M00001553D-D10 


22814 


3 


o 


o 


0 


0 


0 


MOOOOI 555 A-B02 


39539 


2 


o 


o 


0 


1 


0 


mooooi 55 5 a -no i 


30105 
j7 1 y j 


9 


o 


o 


o 


o 


o 


M00001555D:G10 


4561 


8 


4 


4 


8 


0 


0 


M00001556A:C09 


9244 


2 


0 


3 


2 


10 


17 


M00001556A:F11 


1577 


12 


40 


25 


3 


4 


0 


MOOOOI 556A:H01 


15855 


2 


1 


1 


2 


12 


213 


M00001556B:C08 


4386 


7 


8 


3 


1 


3 


21 


M00001556B:G02 


11294 


4 


0 


2 


0 


0 


1 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 


M00001557A:D02 


7065 


5 


3 


2 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


cioiics in 




IT* 

IU 


I Shi 




J-|IU<J 




Lib8 


Lib9 


MO0O0H57A-F01 
iviuuuu i «j j / r\*r \j i 


9635 


1 


o 


2 


1 


o 


0 


MOOOO 1 557 A *F01 


39490 


2 


o 


o 


o 


1 


0 


MOOO0 1 557R-H10 


5192 


g 


5 


o 


5 


o 


0 


mooooi 557TYnoo 


8761 


1 


4 


o 


1 


o 


1 


MOOO0 1 55RR-H1 1 
ivjuuuu i jjoD.n 1 1 


7514 




1 


0 


0 


o 


o 


moooo i 5fiorvF i o 

IVIUUUU I JOUU.r 1 v 


6558 

UJ JO 


4 


1 


4 


o 


o 


5 


moooo l 5A i a -po5 

IVIUUUU I JO 1 M.V^UJ 


1Q486 


9 


o 

u 


0 


o 


o 


o 


moooo 1 56iR»FOfi 
iviuuuu i jojo.r uo 


109 


9R0 

Z07 


911 


278 


116 


123 


184 


MOOOO 1 5/>4 A «R 1 9 

IVIUUUU 1 JOHn.D 1 Z 


5051 


1 1 


4 


2 


2 


1 


1 


moooo 1 57 1 p-HOfi 

iviuuuu u / 1 v.nyu 


5749 


4 




9 


o 


o 


0 


moooo 1 57rr-fo4 

IVIUUUU 1 J /OOXUH 


91001 


9 


1 


0 


2 


o 


0 


mooooi 57on-POi 
iviuuuu i j iyij.\~xtj 


6510 


R 
0 


1 


ft 


o 


o 


1 


MOOOOI 5RirYA in 
IVIUUUU I JO J La A I U 


6901 

OZ7J 


-l 
J 


j 


9 




ft 


o 


mooooi 5rap-P05 

1Y1UUUU 1 JoOL*XA/J 


4691 


J 


4 


12 

I 4& 


9 


1 

1 


1 


moooo i 5R7 a -r i l 

IVIUUUU 1 JO / r\.D 1 J 


ioi ro 

jyj ou 


9 
z 


o 

u 


ft 


o 


o 


o 


mooooi 5qar«i-ioa 
muuuu i jy^D.riu^ 


960 
zou 


1 RO 


1 RR 

1 00 


97 
z / 


9 


1 s 

1 J 


ft 


MUUUU i jy /CHUz 


AR17 
**OJ / 


O 


9 
z 


10 
1 u 


ft 

U 


-2 
J 


1 

1 


MUUUU I J7 /L/.I^Uj 


10A70 
iU*f /U 


c 

J 


1 


ft 
u 


9 
z 


ft 
u 


1 

1 


mooooi ^ORA-frOi 

IVIUUUU 1 J70/\.vJUJ 


16QQQ 

1 0777 


/i 
*r 


u 


ft 
u 


ft 


ft 


o 


moooo i ao i a «noR 

MUUUU 1 OU 1 A .UUo 


09 70 A 
ZZ /yf 


z 


u 


ft 
u 


o 

u 


ft 


o 


moooo i £oa a *r i o 

IVIUUUU 1 OUHA.D I U 


1100 
ijyy 


AO 

t7 


97 
z / 


10 


7 


10 


23 


MO0001 60A A »F05 
IVIUUUU I OUHn.rUJ 


oyjy i 


9 


o 


0 


o 


o 


o 


iv>f nnnn i &m a «f i i 

IVIUUUU I OU / A.C1 I 


1 1 465 
I I40J 


J 


ft 
u 


ft 

U 


o 


o 


0 


moooo 1 ^or a *roi 

IVIUUUU I OUOA.-DUJ 


7R09 
/ouz 


J 


A 


ft 

u 


1 
I 


o 


o 


moooo i aorr«foi 

IVIUUUU IOUODXUj 


991 55 

ZZ I J J 


1 
J 


o 

u 


ft 

u 


o 


o 


o 


mooooi aiap-fio 

IVIUUUU IO I f^.r 1 U 


111 57 
I j I j / 


4 


1 


ft 

u 


1 


1 


o 


MOOOO 1 (\ 1 7P*F09 
IVIUUUU IO I /COJZ 


17004 

I /UU^T 


4 

*T 


o 

u 


1 

i 


o 


1 


o 


moooo i f> i op • f 1 9 
iviuuuu io 1 r iz 


A011 A 

tUJ I** 


9 
z 


o 

u 


ft 


o 


1 


o 


MOOOO 1 f0 1 P'POR 
IVIUUUU I UZ I v.^UO 


40044 


9 


o 


ft 


o 


o 


o 


mooooi /v/itypio 

IVIUUUU 1 DZJU.r I U 


1 1011 

I J7 1 J 


9 


1 
1 


9 


o 


o 


1 


MOOOO 1 69 A A *ROfi 

IVIUUUU 1 OZtA.DUU 


1977 


10 




3 


3 


5 


1 


moooo 1 694P-F0 1 

ivi\j\j\j\j i uxt^^ru i 


4109 


4 


13 


1 


10 


o 


o 


MOOOO 1630B-H09 

IV1VWV JL UJUU.X 1V7 


5214 


10 


2 


2 


2 


4 


3 


MOOOO 1644C-B07 


39171 


2 


o 


o 


0 


0 


0 


MOOOOI 645 A «C 12 


19267 


2 


o 


o 


o 


o 


1 


MOOOO 1648C-A01 


4665 


5 


9 


o 


o 


o 


o 


MOOOOI 657DP03 


23201 


3 


o 


o 


o 


3 


o 


MOOOO 1657DF08 


76760 


1 


o 


2 


2 


o 


5 


MOOOOI 662P-A09 


21218 


i 

■j 


o 


o 


o 


o 


0 


mooooi aai a *poa 

IVIUUUU 1 00 j A.CU*f 


15709 
J J f uz 


9 
z 


u 


ft 

U 


o 


0 


o 


M00001669B:F02 


6468 


4 


3 


3 


8 


1 


0 


M00001670C:H02 


14367 


3 


0 


0 


0 


0 


0 


M00001673C:H02 


7015 


6 


3 


1 


2 


1 


1 


M00001675A:C09 


8773 


4 


1 


4 


4 


4 


6 


M00001676B:F05 


114.60 


4 


2 


0 


0 


0 


0 


M00001677C:E10 


14627 


1 


2 


1 


0 


1 


0 


M00001677D:A07 


7570 


5 


3 


0 


0 


0 


0 


M00001678D:F12 


4416 


9 


5 


2 


6 


1 


3 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


MUUUU 1 0 /VA.AUo 


ooou 


•7 


A 
U 


A 

4 


2 


1 


A 
U 


\>taaaa i ato A.rin 
MUUUU I o /y A .r I U 


ZOo / J 


1 
1 


A 
U 


A 
U 


A 
U 


1 
1 


A 
U 


NifAAAA 1 /C7AD .cn 1 

MUUUU lo/VD.rUl 


o2yo 


a 

2 


A 

4 


c 

J 


3 


1 


A 
U 


\ A*AAAA 1 iC"7Q/^.tTA 1 

MUUUU I o /yc.ru I 


/ouy l 


i 
1 


• A 

U 


A 
U 


A 
U 


A 
0 


A 
U 


MUUUU 1 o /yD.UVJ 


1 A*7< 1 
lU/jl 


3 


a 
2 


A 
0 


1 


A 

0 


t 
I 


MUUUU 10 /VD.UUj 


lU/Jl 


3 


a 

2 


A 
U 


1 


A 

u 


i 
1 


n >iaaaa i £OArvrAQ 
MUUUU 1 OoUDrr Uo 


lUjjy 


a 

2 


i 
1 


1 


A 
U 


1 


A 
U 


MUUUU 1 682C:B 1 2 


1 *7ACC 

1 /Ujj 


A 

4 


a 
U 


A 
0 


A 

0 


A 

0 


A 

u 


X 4AAAA 1 £.0£. A .T?A^ 

MUU001686A:E06 


4622 


7 


6 


4 


A 

2 


3 


0 


MOUOU 1 688C:FU9 


5382 


6 


a 
2 


6 


A 

2 


A 

0 


3 


X /TAAAA 1 ^A1/^.PA1 

MOUOU 1693C:G01 


J7AT 

4393 


1 A 

10 


6 


2 


4 


1 


1 


M0000l7l6D:H05 


67252 


1 

1 


0 


A 

0 


1 


0 


0 


M0000374 1 D:C09 


/t A1 AO 

40108 


A 

2 


A 

0 


A 

0 


A 

0 


0 


0 


X /AAAAnil"jr»./^Af 

M00003747D:C05 


1 1476 


6 


0 


0 


0 


0 


0 


X jfAAAAnCAD .FJ AA 

M00003759B:B09 


697 


76 


C A 

52 


30 


72 


21 


30 


X JfAAftA')')^/ , '.nAO 

M00003762C:B08 


17076 


4 


A 

0 


0 


0 


0 


0 


M00003763A:F06 


1 1 AO 

3108 


1 A 

14 


1 1 


7 


5 


0 


1 


X iAAAA1TT>ir'. A A*J 

M00003774C:A03 


67907 


1 


0 


0 


0 


0 


0 


X ilAAAAMA^r'.nAf 

M00003796C:D05 


CC 1 A 

5619 


3 


5 


3 


3 


0 


4 


X JAAAATOA^n. A r\/" 

M00003826B:A06 


1 1350 


3 


3 


0 


0 


1 


0 


lifAAAATCJI A .T~!f\C 

M00003833A:E05 


r\ i 0*7*7 

21877 


A 

2 


1 


0 


A 

0 


0 


1 


X AAAAA'^O-iTT'*. A A1 

M00003837D:A0l 


7899 


5 


A 

4 


0 


2 


1 


0 


M00003839A:D08 


7798 


5 


a 
2 


2 


0 


0 


1 


AJAAAA^OvlilP.DI 1 

M00003844C:B1 1 


/CIA 

6539 


o 

8 


3 


0 


0 


0 


1 


x xaaaao oa £n .T\n/z 

MUUUU3 846B :DUo 


6874 


6 


3 


A 

0 


A 

0 


0 


A 

0 


X /fAAAAIOC 1 1~> . T~\ 1 A 

M000Q3851B:D10 


13595 


4 


A 

0 


1 


A 

0 


0 


1 


tla nnnAio<i a .r\A/i 
MUUUU38j3A:L>U4 


5619 


3 


c 
J 


3 


3 


A 

0 


A 

4 


MUUUU3o53A:r 12 


1 AC 1 C 

1U515 


5 


1 
1 


A 

0 


1 


1 


A 

2 


MUUUU3 8 56B :CU2 


4622 


7 


r 

0 


4 


A 

2 


3 


A 

0 


KvinAAAIDCl A ./~* 1 A 

MUUUU3857A:01U 


3io9 


4 


1 1 


13 


A 

2 


0 


A 

0 


X/AAAA10C7 A .1 I AO 

MUUUU3857A:HU3 


ah i o 
4/18 


A 

4 


5 


5 


A 

2 


4 


6 


X/fAAAAOOTI /^.ITAO 

MUUUU3 8 / 1 CbU2 


4573 


c 

J 


7 


A 

2 


5 


A 

0 


1 


\A A AAA1 Q*7CD .t?A/l 

MUUUUJo /JDlr U4 


1 OQT7 

ily / / 


J 


A 

u 


A 
0 


A 
U 


A 

0 


A 
0 


IV >l A A AA1B T C D • CA /I 

MUUUUJo /JD.rU4 


\Ly 1 1 


c 

J 


A 

u 


A 
U 


A 
U 


A 

0 


A 
0 


K/f A AA A1 »r5 AT 

MUUUUJo / JCUU / 


o4/y 


A 

4 


J 


1 


1 


A 

2 


A 

4 


Avf AAAmQ*7£T"VP 10 

MUUUUjo /OU.clZ 




c 
J 


o 


2 


A 
U 


A 
0 


i 
1 


KyfAAAAICTOTJ./^l 1 
MUUUUJo / VDlVvl 1 


jJ4j 


1 


i 
1 


/ 


• A 

4 


6 


27 


\A r AAAA'?fl70Q'Fl1 A 
JViUUUUJo /yD.D 1 U 


J 1 JO / 


i 
1 


1 


A 
U 


A 
U 


1 

I 


A 

u 


\/f AAAA1 Q*70rV A AO 

MUUUUJo /yU.AU2 


1 A CAT 
145U/ 


•5 
J 


i 
1 


A 

u 


A 

u 


3 


1 


MAAAATOOCP. A AO 

MUUUU3oojL:AU2 


13576 


5 


A 

0 


A 

0 


A 

0 


12 


1 1 


M00003885C:A02 


13576 


5 


0 


0 


0 


12 


11 


M00003906C:E10 


9285 


4 


3 


0 


0 


1 


2 


M00003907D:A09 


39809 


I 


0 


0 


0 


2 


1 


M00003907D:H04 


16317 


3 


0 


0 


0 


0 


0 


M00003909D:C03 


8672 


4 


4 


0 


0 


0 


0 


M00003912B:D01 


12532 


4 


1 


0 


1 


0 


1 


M00003914C:F05 


3900 


9 


6 


8 


1 


7 


13 


M00003922A:E06 


23255 


3 


0 


1 


1 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones 




ID 


Libl 


Lib2 


Lib3 


Lib4 


Lib8 


Lib9 


R/f AAAA1QCQ A »Uf\0 

MUUUUj95oA.riU2 


1 QACO 

1895/ 


3 


A 
U 


A 
0 


A 
0 


0 


0 


X AAAAAOACO A .LinO 

M00003958A:H02 


18957 


3 


0 


0 


0 


0 


0 


K/f AAAA7 ACO/"<./'> 1 A 

MUUUU395oC.U I U 


AAA C C 

4U455 


o 
2 


A 
U 


A 

0 


A 

0 


0 


A 

0 


MUUUU395oC.lJ 1 u 


A A A C C 

4U455 


2 


A 
U 


A 

0 


A 

0 


0 


0 


MUUUUjyooD.r uo 


OA A CG 

z44oo 


o 

2 


A 
U 


1 

1 


A 

4 


0 


A 

0 


K /f A A A A 1 Q*7 A C* • Q A Q 


A A1 OO 

4U I LL 


2 


A 
U 


A 

0 


A 

0 


0 


0 


\ /f A A A A 1 0*7 /I n • t? AT 
MUUUU jy / 4L/.I1U / 




o 

3 


A 
U 


A 
0 


A 
0 


0 


A 

0 


IV A A A A A 1 Q*7 /I P\ « U AO 

MUUUU jy /4U.HU2 


23358 


o 

3 


A 

0 


0 


0 


1 


0 


X/IAAAAIQTC A ./^ 1 1 

MUUUU39 /jAiOI 1 


12439 


A 

4 


0 


A 

0 


A 

0 


0 


0 


N/IAAAAIAOOQ./^ AC 

Muuuujy /oB:vjU-> 


5oy3 


7 


A 

4 


1 


3 


1 


1 


x aaaaaoao i a ,r?i a 

M00QQ3981A:E10 


3430 


9 


1 A 

10 


7 


3 


0 


0 


M00003982C:C02 


2433 


10 


13 


21 


18 


8 


8 


M00003983A:A05 


9105 


5 


1 


1 


1 


0 


0 


x aaaaaa a^ orv. k f\r 

M00004028D:A06 


6124 


4 


8 


1 


9 


1 


0 


M00004028D:C05 


A AAT1 

40073 


2 


0 


1 


0 


0 


1 


x aaaaaa ao i a * a n 

MUUUU4U3 1 A:A 1 2 


9061 


5 


2 


0 


0 


0 


0 


\4AAAA>IA^1 A . A 1 O 

MU0UU4U3 1 A: A 1 2 


906 1 


5 


2 


0 


0 


0 


0 


MUUUU4U35L:A07 


37285 


2 


0 


0 


1 


0 


1 


x a aaaa a ao crv.riA/: 

MUUUu4U35D:B06 


1 ?A1 /T 

17036 


4 


0 


0 


0 


0 


0 


x/taaaaaaca a . r^\r\zr 

MUUUU4U59A:UU6 


C A 1 *7 

54 1 7 


10 


4 


0 


9 


2 


0 


\A AAAA/1 A/COD ■ A A 1 

MUUUU4UooB:A01 


3706 


7 


1 A 

14 


4 


22 


1 


0 


M00Q04Q72B:B05 


17036 


4 


0 


0 


0 


0 


0 


X A AAAA A AO 1 /"« 1 A 

M0Q00408 1 C:D 1 0 


15069 


3 


0 


0 


1 


0 


0 


X AAAA A A AO 1 f~*.T\ 1 O 

M0000408 1 C:D 1 2 


1 A OA 1 

14391 


3 


1 


0 


0 


0 


0 


X AAAAAA AO/Tr**./^ A/T 

MUUUU4U 8oD :O06 


AOOC 

9285 


4 


3 


0 


0 


1 


2 


X./IAAAA/1 AQorv A A1 

MUUUU4U8 fU.AV I 


C QOA 

688U 


2 


6 


1 


1 


0 


0 


X./TAAAA/I ftOin>D 1 O 

MUUUU4U93D:B 1 2 


5325 


5 


5 


2 


0 


2 


1 


\A AAAA/I AQ7rVD 1 O 
MUUUU4UyjU.t> iz 


5325 


5 


c 

5 


2 


0 


2 


1 


lV/fAAAA/l 1 AC/™*. A A/1 

MUUUU4 1 U5CAU4 


*700 1 

722 J 


5 


2 


2 


2 


0 


0 


K/f AAAA/I 1 AQ a .CA/C 

MUUUU4 1 UoAlilUO 


A A1*7 

4937 


A 

4 


A 

9 


3 


1 


3 


1 


\A AAAA/I 111 I"V A AO 

MUUUU4 1 1 1 D.AUo 


05/4 


O 


3 


0 


0 


0 


0 


N/f AAAA/1 1 1 i4/^«U 1 1 

MUUUU41 14Cr 1 1 


1 O 1 QO 

13 183 


2 


3 


A 

0 


1 


0 


1 


MUUUU4 tJoo.rlUz 


1 00*70 

132/2 


3 


2 


A 

0 


3 


0 


0 


\/f AAAA/1 1 A&C^'C* 1 1 
JV1UUUU4 1 40CV-/ 1 1 


COC7 

525 / 


2 


Q 
0 


c 

5 


5 


5 


25 


\A AAAA/1 1 C 1 n.DHO 
MUUUU4 1 j l U.d\)o 


16977 


4 


A 

0 


0 


0 


0 


0 


\A AAAA/1 1 C*7rv A AA 

MUUUU4 1 j /L/. AUy 


/£/1 

6455 


3 


1 


0 


0 


0 


0 


\/f AAAA/1 f ZQ/^-r 1 1 o 

MUUUU4 1 Oy U.C 1 2 


C5 1 A 

5319 


6 


2 


8 


2 


2 


3 


MUUUU4 l / 1 U.dUj 


49Uo 


0 


7 


2 


2 


2 


0 


\A AAAA/1 1 TO/^.r^AQ 
MUUUU41 fZ\*.V\)o 


1 1 /t A/1 

1 14V4 


A 

4 


A 

0 


A 

0 


0 


0 


0 




I**! AO 


J 


A 

u 


n 
U 


A 
U 


A 
U 


A 

0 


M00004185C:C03 


11443 


5 


1 


0 


0 


0 


0 


M00004197D:H01 


S210 


2 


6 


0 


0 


0 


0 


M00004203B:C12 


14311 


4 


0 


0 


0 


1 


2 


M00004212B:C07 


2379 


26 


13 


4 


2 


2 


3 


M00004214C:H05 


11451 


3 


2 


1 


2 


1 


1 


M00004223A:G10 


16918 


4 


0 


0 


0 


0 


0 


M00004223B:D09 


7899 


5 


4 


0 


2 


1 


0 


M00004223D:E04 


12971 


4 


0 


0 


0 


1 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Cione Name 


Cluster 


Clones in 


Clones 




ID 


LlDl 


Lib2 


ivi uuu y d . r u o 


04 J D 


-5 
J 


i 
i 




I L 1 Z 




c 

J 


\a nc\f\ (\a o/corv r\n& 
MUUUU4Z0VU.UUD 


4yuj 


n 
I 


0 


MUUUU4Z / DULI 1 


1 AO lid 


1 
J 


A 

u 


X Af\i\f\f\ <noi D . A AvI 


14286 


3 


1 


M00004285B:E08 


56020 


1 


0 


M00004295D:F12 


16921 


4 


0 


M00004296C:H07 


13046 


4 


1 


M00004307C:A06 


9457 


2 


0 


M00004312A:G03 


26295 


2 


0 


M00004318C:D10 


21847 


2 


1 


MOO004372A:AO3 


2030 


13 


10 


M00004377C:F05 


2102 


12 


20 



Clones in Clones in Clones in Clones in 



T iWX 

LID J 


LID 4 


T ikO 

LlDO 




D 




A 

U 


A 

u 


9 


] 

1 


-5 
J 


A 

u 


J 


1 
1 


•3 
J 


1 

I 


n 
u 


1 
J 


A 
U 


A 
U 


0 


1 


1 


1 


0 


0 


0 


0 


0 


1 


2 


1 


0 


1 


0 


0 


5 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


32 


4 


0 


0 


23 


21 


6 


5 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


duster 


Clones in 


Clones in 


clones in 


Pinnae in 

Clones in 


clones in 


Clones in 




m 




IjIUIvU 


JjlUl / 


Lib 18 


Lib 19 


Lib20 


M00001340B*A06 


17062 


0 


0 


0 


0 


0 


0 


M00001340D-F10 


1 1589 


0 


0 


0 


0 


0 


0 


M00001341A-E12 


4443 


0 


0 


0 


1 


0 


0 


M00001342B*E06 


39805 


0 


0 


0 


0 


0 


0 


M00001343C-F10 


2790 


0 


0 


0 


0 


0 


0 


M00001 343D-H07 

1V1V/WV/ 1 J*T«/1-/.J IV/ / 


23255 


0 


0 


0 


0 


0 


0 


M0000134SA-F01 


6420 


0 


0 


0 


0 


0 


0 


Iviv/v/Uv l j*tvj.rv.rv7 


5007 


0 


0 


0 


0 


0 


0 


M000.01 3461>F03 

1V1V/V/V/V7 1 «/*tV/i-/.l-*V/J 


6806 


0 


n 

V 


0 


0 


0 


0 




S77Q 


0 


n 

v/ 


0 


0 


0 


0 


Mno.no i ^4fin-nofi 


5779 


0 


0 


0 


0 


0 


0 


M00001 347A-R10 


13576 


0 


0 


0 


0 


0 


0 


M00001 348R-B04 


16927 


0 


0 


0 


0 


0 


0 


M00001 348R-fi06 

1Y1V/V/V/V/ 1 J*TOJJ«VJV/V7 


16985 


0 


0 


0 


0 


0 


0 


M00001 34QR-R08 
iviv/v/U v i jtyjj .duo 




A 

V 


0 


0 


0 

V/ 


0 

V/ 


0 


M00001 7^0A 'HOI 


7187 


0 


0 


0 


0 

V/ 


0 


0 


MonnonsiR'AOK 


3169 


n 


1 

1 


0 

V/ 


0 


1 

1 


0 


Mnnnonsi r-aok 


3169 

J 1 VJX 


0 


1 

1 


0 


0 

V/ 


1 

1 


0 


N/ioono 1 1 <\9 a 'F09 


1 6945 


0 

u 


0 


n 


n 

V 




0 


mooooi tsta -ni9 


8078 
OVJ / 0 


0 


n 


0 


0 


0 


0 


Moooon^n-nin 

IVIUV/V/V 1 J J JU%U 1 w 


14929 


0 




1 
1 


0 


5 


0 


mooooi 3ssR*fii o 

IVIUvV/i/ iJJJDiVJ 1 V 


14391 


0 


0 


0 


0 

V/ 


n 

V/ 


0 


mooooi ^S7n*r)i i 


40 S 9 


0 


0 

V/ 


0 


0 


0 

V/ 


0 


MOOOOI 361 A'AOS 


4141 


0 


0 


0 


0 


0 


0 


MOOOOI 36 1D-F08 


2379 


0 


0 


0 


0 


0 


0 


MOOOOI 362R-D 10 

IVIV/V/VV 1 JVAil/<i/ 1 V/ 


5622 


0 


0 


0 


0 


0 


0 


MOOOOI 362C'H1 1 

iviv/vvvsv/ i jUj&Vy t n j j 


945 


0 


0 


0 


0 


0 


1 


M0000136SC*C10 


40132 

TV 1 Ji< 


0 


0 


0 


0 


0 


0 


MOOOOI 370A-C09 

IVIVV/V/V/ U / \Jf\»\—\J7 


6867 


0 


0 


0 

V/ 


0 


0 


0 


MOOOOI 37 1C-E09 


7172 


0 


0 


0 


0 


0 


0 


MOOOOI 376R-G06 

1Y1V/VJVJV/ 1 J / UO.VJVU 


17732 


0 


0 


0 


0 


0 


1 


M00001378B-B02 


39833 


0 


0 


0 


0 


0 


0 


M00001379AA05 


1334 


0 


0 


0 


0 


0 


1 


MOOOOI 3 80D-B09 


39886 


0 


0 


0 


0 


0 


0 


M00001382C-A02 


22979 


0 


0 


0 


0 


0 


0 


MOOOOI 383A"C03 


39648 


0 


0 


0 


0 


0 


0 


MOOOOI 383AC03 


39648 


0 


0 


0 


0 


0 


0 


M00001386OB12 


5178 


0 


0 


0 


0 


0 


0 


M00001387A-C05 


2464 


0 


0 


0 


0 


0 


0 


ivlUUUU 1 Jo /D.VjUj 


7S87 






0 


0 

1/ 


0 




M00001388D:G05 


5832 


0 


0 


0 


0 


0 


0 


M00001389A:C08 


16269 


0 


1 


0 


0 


0 


0 


M00001394A:F01 


6583 


l 


4 


1 


0 


0 


0 


M00001395A:C03 


4016 


0 


0 


0 


0 


0 


0 


M00001396A:C03 


4009 


0 


0 


0 


0 


0 


0 


M00001402A:E08 


39563 


0 


0 


0 


0 


0 


0 


M00001407B:D11 


5556 


0 


0 


0 


0 


0 


0 


M00001409C:D12 


9577 


0 


0 


0 


0 ■ 


0 


0 



297 



WO 99/33982 



PCT/US98/27610 



Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001410A:D07 


7005 


0 


0 


0 


0 


0 


0 


M00001412B:B10 


8551 


0 


0 


0 


0 


0 


0 


M00001415A:H06 


13538 


0 


0 


0 


0 


0 


0 


M00001416A:H01 


7674 


0 


0 


0 


0 


0 


0 


M00001416B:H11 


8847 


0 


0 


0 


0 


0 


0 


M00001417A:E02 


36393 


0 


0 


0 


0 


0 


0 


M00001418B:F03 


9952 


0 


0 


0 


0 


0 


0 


M00001418D:B06 


8526 


0 


0 


0 


0 


0 


0 


M00001421C:F01 


9577 


0 


0 


0 


0 


0 


0 


M00001423B:E07 


15066 


0 


0 


0 


0 


0 


0 


M00001424B:G09 


10470 


0 


0 


0 


0 


0 


0 


M00001425B:H08 


22195 


0 


0 


0 


0 


0 


0 


M00001426D:C08 


4261 


0 


0 


1 


0 


0 


1 


M00001428A:H10 


84182 


0 


0 


0 


0 


0 


0 


M00001429A:H04 


2797 


0 


0 


0 


0 


0 


0 


M00001429B:A11 


4635 


0 


0 


0 


0 


0 


0 


MOOOOI429D:D07 


40392 


0 


0 


0 


0 


0 


0 


M00001439C:F08 


40054 


0 


0 


0 


0 


0 


0 


M00001442C:D07 


16731 


0 


0 


0 


0 


0 


0 


M00001445A:F05 


13532 


0 


0 


0 


0 


0 


0 


M00001446A:F05 


7801 


0 


0 


0 


0 


0 


0 


M00001447A:G03 


10717 


0 


0 


0 


0 


0 


0 


M00001448D:C09 


8 


1 


6 


6 


1 


14 


1 


M00001448D:H01 


36313 


0 


3 


0 


0 


3 


0 


M00001449A:A12 


5857 


0 


0 


0 


0 


0 


0 


M00001449A:B12 


41633 


0 


0 


0 


0 


0 


0 


M00001449A:D12 


3681 


0 


0 


0 


0 


0 


0 


M00001449A:G10 


36535 


0 


0 


0 


0 


0 


0 


M00001449C:D06 


86110 


0 


0 


0 


0 


0 


0 


M00001450A:A02 


39304 


0 


0 


0 


0 


0 


0 


M00001450A:A11 


32663 


0 


0 


0 


0 


0 


0 


M00001450A:B12 


82498 


0 


0 


0 


0 


0 


0 


M00001450A:D08 


27250 


0 


0 


0 


0 


0 


0 


M00001452A:B04 


84328 


0 


0 


0 


0 


0 


0 


M00001452A:B12 


86859 


0 


0 


0 


0 


0 


0 


M00001452A:D08 


1120 


0 


0 


0 


0 


0 


0 


M00001452A:F05. 


85064 


0 


0 


0 


0 


0 


0 


M00001452C:B06 


16970 


0 


0 


2 


0 


1 


0 


M00001453A:E11 


16130 


0 


0 


0 


0 


0 


0 


M00001453C:F06 


16653 


0 


0 


0 


0 


0 


0 


M00001454A:A09 


83103 


0 


0 


0 


0 


0 


0 


M00001454B:C12 


7005 


0 


0 


0 


0 


0 


0 


M00001454D:G03 


689 


0 


2 


2 


0 


4 


2 


M00001455A:E09 


13238 


0 


0 


0 


0 


0 


0 


M00001455B:E12 


13072 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


0 


0 


0 


M00001460A:F06 


2448 


0 


0 


0 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001461A:D06 


1531 


0 


0 


0 


0 


0 


0 


M00001463C:B11 


19 


2 


13 


13 


0 


69 


10 


M00001465A:B11 


10145 


0 


0 


0 


0 


0 


0 


M00001466A:E07 


4275 


0 


0 


0 


0 


0 


0 


M00001467A:B07 


38759 


0 


0 


0 


0 


0 


0 


M00001467A:D04 


39508 


0 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


0 


0 


0 


M00001467A:E10 


39442 


0 


0 


0 


0 


0 


0 


M00001468A:F05 


7589 


0 


0 


0 


0 


0 


0 


M00001469A:C10 


12081 


0 


0 


0 


0 


0 


0 


M00001469A:H12 


19105 


0 


0 


0 


0 


0 


0 


M00001470A:B10 


1037 


0 


0 


0 


0 


0 


0 


M00001470A:C04 


39425 


0 


0 


0 


0 


0 


0 


M00001471A:B01 


39478 


0 


0 


0 


0 


0 


0 


M0000I481D:A05 


7985 


0 


0 


0 


0 


0 


0 


M00001490B:C04 


18699 


0 


0 


0 


0 


0 


0 


M00001494D:F06 


7206 


0 


0 


0 


0 


0 


0 


M00001497A:G02 


2623 


0 


0 


0 


0 


0 


0 


M00001499B:A11 


10539 


0 


0 


0 


0 


0 


0 


MO0OO1500A:CO5 


5336 


0 


0 


0 


0 


0 


0 


M00001500A:E11 


2623 


0 


0 


0 


0 


0 


0 


M00001500C:E04 


9443 


0 


0 


0 


0 


0 


0 


M00001501D:C02 


9685 


0 


0 


0 


0 


0 


0 


M00001504C:A07 


10185 


0 


0 


0 


0 


0 


0 


M00001504C:H06 


6974 


0 


0 


0 


0 


0 


0 


M00001504D:G06 


6420 


0 


0 


0 


0 


0 


0 


M00001507A:H05 


39168 


0 


0 


0 


0 


0 


0 


M00001511A:H06 


39412 


0 


0 


0 


0 


0 


0 


M00001512A:A09 


39186 


0 


0 


0 


0 


0 


0 


M00001512D:G09 


3956 


0 


0 


1 


0 


0 


0 


M00001513A:B06 


4568 


0 


0 


0 


0 


0 


0 


M00001513C:E08 


14364 


0 


0 


0 


0 


0 


0 


M00001514C:D11 


40044 


0 


1 


0 


0 


0 


0 


M00001517A:B07 


4313 


0 


0 


0 


0 


0 


0 


M00001518C:B11 


8952 


0 


0 


0 


0 


0 


0 


M00001528A:C04 


7337 


0 


0 


0 


0 


0 


0 


MOO001528A:F09 


18957 


0 


0 


0 


0 


0 


0 


M00001528B:H04 


8358 


0 


0 


0 


0 


0 


0 


M00001531A:D01 


38085 


0 


0 


0 


0 


0 


0 


M00001532B:A06 


3990 


1 


1 


0 


0 


0 


0 


M00001533A:C11 


2428 


0 


0 


1 


0 


0 


0 


M00001534A:C04 


16921 


0 


0 


0 


0 


0 


0 


M00001534A:D09 


5097 


0 


0 


0 


0 


0 


0 


M00001534A:F09 


5321 


0 


1 


0 


0 


2 


0 


M00001534C-.A01 


4119 


0 


0 


0 


0 


0 


0 


M00001535A:B01 


7665 


0 


0 


0 


0 


0 


0 


MOO001535A:C06 


20212 


0 


0 


0 


0 


0 


0 


M00001535A:F10 


39423 


0 


0 


0 


0 


0 


0 
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Table 6 AH Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001536A:B07 


2696 


0 


0 


0 


0 


3 


0 


M00001536A:C08 


39392 


0 


0 


0 


0 


0 


0 


M00001537A:F12 


39420 


0 


0 


0 


0 


0 


0 


M00001537B:G07 


3389 


0 


0 


0 


0 


0 


0 


M00001540A:D06 


8286 


0 


0 


0 


0 


0 


0 


M00001541A:D02 


3765 


0 


0 


0 


0 


0 


0 


M00001541A:F07 


22085 


0 


0 


0 


0 


0 


0 


M00001541A:H03 


39174 


0 


0 


0 


0 


0 


0 


M00001542A:A09 


22113 


0 


0 


0 


0 


0 


0 


M00001542A:E06 


39453 


0 


0 


0 


0 


0 


0 


M00001544A:E03 


12170 


0 


0 


0 


0 


0 


0 


M00001544A:G02 


19829 


0 


0 


0 


0 


0 


0 


M00001544B:B07 


6974 


0 


0 


0 


0 


0 


0 


M00001545A:C03 


19255 


0 


0 


0 


0 


0 


0 


M00001545A:D08 


13864 


0 


0 


0 


0 


0 


0 


M00001546A:G11 


1267 


1 


0 


0 


0 


7 


0 


M00001548A:E10 


5892 


0 


0 


0 


0 


0 


0 


M00001548A:H09 


1058 


0 


0 


1 


0 


0 


0 


M00001549A:B02 


4015 


0 


0 


0 


0 


0 


0 


M00001549A:D08 


10944 


0 


0 


0 


0 


0 


0 


M00001549B:F06 


4193 


0 


0 


0 


0 


0 


0 


M00001549C:E06 


16347 


0 


0 


0 


0 


0 


0 


M00OO1550A:AO3 


7239 


0 


0 


0 


0 


0 


0 


M00001550A:G01 


5175 


0 


0 


0 


0 


0 


0 


M00001551A:B10 


6268 


0 


0 


0 


0 


0 


0 


M00001551A:F05 


39180 


0 


0 


0 


0 


0 


0 


M00001551A:G06 


22390 


0 


0 


0 


0 


0 


0 


M00001551C:G09 


3266 


0 


0 


1 


0 


0 


0 


M00001552A:B12 


307 


0 


0 


0 


0 


3 


0 


M00001552A:D11 


39458 


0 


0 


0 


0 


0 


0 


M00001552B:D04 


5708 


0 


1 


0 


0 


0 


0 


M00001553A:H06 


8298 


0 


0 


0 


0 


0 


0 


M00001553B:F12 


4573 


0 


0 


0 


0 


0 


0 


M00001553D:D10 


22814 


0 


0 


0 


0 


0 


0 


M00001555A:B02 


39539 


0 


0 


0 


0 


0 


0 


M00001555A:C01 


39195 


0 


0 


0 


0 


0 


0 


M00001555D:G10 


4561 


0 


0 


0 


0 


0 


0 


M00001556A:C09 


9244 


0 


0 


0 


0 


0 


0 


MG0001556A:F11 


1577 


0 


0 


0 


0 


0 


0 


M00001556A:H01 


15855 


3 


5 


5 


0 


3 


1 


M00001556B:C08 


4386 


1 


2 


0 


0 


0 


0 


M00001556B:G02 


11294 


0 


0 


0 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


0 


0 


0 


M00001557A:F01 


9635 


0 


0 


0 


0 


0 


0 


M00001557A:F03 


39490 


0 


0 


0 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


0 


0 


0 


M00001558B:H11 


7514 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001560D:F10 


6558 


0 


0 


0 


0 


0 


0 


M00001561A:C05 


39486 


0 


0 


0 


0 


0 


0 


M00001563B:F06 


102 


22 


38 


65 


7 


43 


10 


M00001564A:B12 


5053 


0 


0 


1 


0 


0 


0 


M00001571C:H06 


5749 


0 


0 


0 


0 


0 


0 


M00001578B:E04 


23001 


0 


0 


0 


0 


0 


0 


M00001579D:C03 


6539 


0 


0 


0 


0 


0 


0 


M00001583D:A10 


6293 


0 


0 


0 


0 


0 


0 


M00001586C:C05 


4623 


0 


0 


0 


0 


1 


0 


M0OO0I587A:Bll 


39380 


0 


0 


0 


0 


0 


0 


M00001594B:H04 


260 


0 


0 


0 


0 


1 


0 


M00001597C:H02 


4837 


0 


0 


0 


0 


0 


0 


M00001597D:C05 


10470 


0 


0 


0 


0 


0 


0 


M00001598A:G03 


16999 


1 


1 


1 


0 


0 


0 


M00001601A:D08 


22794 


0 


0 


0 


0 


0 


0 


M00001604A:B10 


1399 


0 


0 


0 


0 


0 


0 


M00001604A:F05 


39391 


0 


0 


0 


0 


0 


0 


M00001607A:E11 


11465 


0 


0 


0 


0 


0 


0 


M000016O8A:B03 


7802 


0 


0 


0 


0 


0 


0 


M00001608B:E03 


22155 


0 


0 


0 


0 


0 


0 


M00001614C:F10 


13157 


0 


0 


0 


0 


0 


0 


M00001617C:E02 


17004 


0 


0 


0 


0 


1 


0 


M00001619C:F12 


40314 


0 


0 


0 


0 


0 


0 


M00001621C:C08 


40044 


0 


1 


0 


0 


0 


0 


M00001623D:F10 


13913 


0 


0 


0 


0 


0 


0 


M00001624A:B06 


3277 


0 


0 


0 


0 


0 


0 


M00001624C:F01 


4309 


0 


0 


0 


0 


0 


0 


M00001630B:H09 


5214 


1 


0 


0 


1 


1 


0 


M00001644C:B07 


39171 


0 


0 


0 


0 


0 


0 


M00001645A:C12 


19267 


0 


0 


0 


0 


1 


0 


M00001648C:A01 


4665 


0 


0 


0 


0 


0 


0 


M00001657D:C03 


23201 


0 


0 


0 


0 


0 


0 


M00001657D:F08 


76760 


0 


0 


0 


0 


0 


0 


M00001662C:A09 


23218 


0 


0 


0 


0 


0 


0 


M00001663A:E04 


35702 


0 


0 


0 


0 


0 


0 


M00001669B:F02 


6468 


0 


0 


0 


0 


0 


0 


M00001670C:H02 


14367 


0 


0 


0 


0 


0 


0 


M00001673C-H02 


7015 


0 


0 


0 


0 


0 


0 


M00001675A:C09 


8773 


0 


0 


0 


0 


0 


0 


M00001676B:F05 


11460 


0 


0 


0 


0 


0 


0 


M00001677C:E10 


14627 


0 


1 


0 


0 


0 


0 


M00001677D:A07 


7570 


0 


0 


0 


0 


0 


0 


M00001678D:F12 


4416 


0 


0 


0 


0 


0 


0 


M00001679A:A06 


6660 


0 


0 


0 


0 


0 


0 


M00001679A:F10 


26875 


0 


0 


0 


0 


0 


0 


M00001679B:F01 


6298 


0 


0 


0 


0 


0 


0 


M00001679C:F01 


78091 


0 


0 


0 


0 


0 


0 


M00001679D:D03 


10751 


0 


0 


0 


0 


0 


0 


M00001679D:D03 


10751 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


LiblS 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00001680D:F08 


10539 


0 


0 


0 


0 


0 


0 


M00001682C:B12 


17055 


0 


0 


0 


0 


0 


0 


M00001686A:E06 


4622 


0 


0 


0 


0 


0 


0 


M00001688C:F09 


5382 


0 


0 


0 


0 


0 


0 


M00001693C:G01 


4393 


0 


0 


0 


0 


0 


0 


M00001716D:H05 


67252 


0 


0 


0 


0 


0 


0 


MO0003741D:C09 


40108 


0 


0 


0 


0 


0 


0 


M00003747D:C05 


11476 


0 


0 


0 


0 


0 


0 


M00003759B:B09 


697 


0 


0 


0 


0 


1 


0 


M00003762C:B08 


17076 


0 


0 


0 


0 


0 


0 


M00003763A:F06 


3108 


0 


0 


0 


0 


0 


0 


M00003774C:A03 


67907 


0 


0 


0 


0 


0 


0 


M00003796C:D05 


5619 


0 


0 


0 


0 


0 


0 


M00003826B:A06 


11350 


0 


0 


0 


0 


0 


0 


M00003833A:E05 


21877 


0 


0 


0 


0 


0 


0 


M00003837D:A01 


7899 


0 


0 


0 


0 


0 


0 


MO0003839A:D08 


7798 


0 


0 


0 


0 


0 


0 


M00003844C:B11 


6539 


0 


0 


0 


0 


0 


0 


M00003846B:D06 


6874 


0 


0 


1 


0 


0 


0 


M00003851B:D10 


13595 


0 


0 


0 


0 


0 


0 


M00003853A:D04 


5619 


0 


0 


0 


0 


0 


0 


MO0O03853A:F12 


10515 


0 


0 


0 


0 


0 


0 


M00003856B:C02 


4622 


0 


0 


0 


0 


0 


0 


M00003857A:G10 


3389 


0 


0 


0 


0 


0 


0 


M00003857A:H03 


4718 


0 


0 


0 


0 


0 


0 


MO0O03871C:EO2 


4573 


0 


0 


0 


0 


0 


0 


M00003875B:F04 


12977 


0 


0 


0 


0 


0 


0 


M00003875B:F04 


12977 


0 


0 


0 


0 


0 


0 


M00003875C:G07 


8479 


0 


0 


0 


0 


0 


1 


M00003876D:E12 


7798 


0 


0 


0 


0 


0 


0 


M00003879B:C11 


5345 


0 


0 


0 


2 


0 


1 


M00003879B:D10 


31587 


0 


0 


0 


0 


0 


0 


M00003879D:A02 


14507 


0 


0 


0 


0 


0 


0 


MO00O3885C:AO2 


13576 


0 


0 


0 


0 


0 


0 


M00003885C:A02 


13576 


0 


0 


0 


0 


0 


0 


M00003906C:E10 


9285 


0 


0 


0 


0 


0 


0 


M00003907D:A09 


39809 


0 


0 


0 


0 


0 


0 


M00003907D:H04 


16317 


0 


0 


0 


0 


0 


0 


M00003909D:C03 


8672 


.0 


0 


0 


0 


0 


0 


M00003912B:D01 


12532 


0 


0 


0 


0 


0 


0 


M00003914C:F05 


3900 


0 


0 


0 


0 


1 


0 


M00003922A:E06 


23255 


0 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


0 


0 


0 


0 


0 


0 


M00003958A:H02 


18957 


0 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


0 


0 


0 


M00003968B:F06 


24488 


0 


0 


0 


0 


0 


0 


M00003970C:B09 


40122 


0 


0 


0 


0 


0 


0 


M00003974D:E07 


23210 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


Libl5 


M00003974D:H02 


23358 


0 


M00003975A:G11 


12439 


0 


M00003978B:G05 


5693 


0 


M00003981A:E10 


3430 


0 


M0O0O3982C:CO2 


2433 


0 


M00003983A:A05 


9105 


0 


M00004028D:A06 


6124 


0 


M00004028D:C05 


40073 


0 


M00004031A:A12 


9061 


0 


M00004031A:A12 


9061 


0 


M00004035C:A07 


37285 


0 


M00004035D:B06 


17036 


0 


M00004059A:D06 


5417 


0 


M00004068B:A01 


3706 


0 


M00004072B:B05 


17036 


0 


M00004081C:D10 


15069 


0 


M00004081C:D12 


14391 


0 


M00004086D:G06 


9285 


0 


M00004087D:A01 


6880 


0 


M00004093D:B12 


5325 


1 


M00004093D:B12 


5325 


1 


M00004105C:A04 


7221 


0 


M00004108A:E06 


4937 


0 


M00004111D:A08 


6874 


0 


M00004114C:F11 


13183 


0 


M00004138B:H02 


13272 


0 


M00004146C:C11 


5257 


0 


M00004151D:B08 


16977 


0 


M00004157C:A09 


6455 


0 


M00004169C:C12 


5319 


0 


M00004171D:B03 


4908 


0 


M00004172C:D08 


11494 


0 


M00004183C:D07 


16392 


0 


M0O004185C:C03 


11443 


0 


M00004197D:H01 


8210 


0 


M00004203B:C12 


14311 


0 


M00004212B:C07 


2379 


0 


M00004214C:H05 


11451 


0 


M00004223A:G10 


16918 


0 


M00004223B:D09 


7899 


0 


M00004223D:E04 


12971 


0 


M00004229B:F08 


6455 


0 


M00004230B:C07 


7212 


0 


M00004269D:D06 


4905 


0 


M00004275C:C11 


16914 


0 


M00004283B:A04 


14286 


0 


M00004285B:E08 


56020 


0 


M00004295D:F12 


16921 


0 


M00004296C:H07 


13046 


0 



Clones in Clones in Clones in Clones in Clones in 



Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


1 


0 


1 


1 


0 


1 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


Libl5 


Libl6b 


Libl7 


Libl8 


Libl9 


Lib20 


M00004307C:A06 


9457 


0 


0 


0 


0 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


0 


0 


0 


M00004372A:AO3 


2030 


0 


0 


0 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 


0 


0 


0 


o' 
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Table 7 All Differential Data for Libs 12-14 
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Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001340B:A06 


17062 


0 


0 


0 


M00001340D:F10 


11589 


0 


0 


0 


M00001341A:E12 


4443 


4 


2 


0 


M00001342B:E06 


39805 


0 


0 


0 


M00001343C:F10 


2790 


0 


0 


0 


M00001343D:H07 


23255 


0 


0 


0 


M00001345A:E01 


6420 


0 


0 


0 


M00001346A:F09 


5007 


0 


0 


0 


M00001346D:E03 


6806 


0 


1 


1 


M00001346D:G06 


5779 


0 


0 


0 


M00001346D:G06 


5779 


0 


0 


0 


M00001347A:B10 


13576 


0 


0 


0 


M00001348B:B04 


16927 


0 


0 


0 


M00001348B:G06 


16985 


0 


0 


0 


M00001349B:B08 


3584 


0 


0 


0 


M00001350A:H01 


7187 


0 


0 


0 


M00001351B:A08 


3162 


0 


.0 


1 


M0000135iB:A08 


3162 


0 


0 


1 


M00001352A:E02 


16245 


0 


0 


0 


M00001353A:G12 


8078 


0 


0 


0 


M00001353D:D10 


14929 


0 


1 


0 


M00001355B:G10 


14391 


0 


0 


0 


M00001357D:D11 


4059 


0 


0 


0 


M00001361A:A05 


4141 


1 


2 


1 


M00001361D:F08 


2379 


0 


0 


0 


M00001362B:D10 


5622 


0 


2 


1 


M00001362C:H11 


945 


0 


0 


0 


M00001365C:C10 


40132 


0 


0 


0 


M00001370A:C09 


6867 


0 


0 


0 


M00001371C:E09 


7172 


0 


0 


1 


M00001376B:G06 


17732 


2 


0 


0 


M00001378B:B02 


39833 


0 


0 


0 


M00001379A:A05 


1334 


0 


0 


0 


M00001380D:B09 


39886 


0 


0 


0 


M00001382C:A02 


22979 


1 


0 


0 


M00001383A:C03 


39648 


0 


0 


0 


M00001383A:C03 


39648 


0 


0 


0 


M00001386C:B12 


5178 


0 


0 


0 


M00001387A:C05 


2464 


0 


0 


0 


M00001387B:G03 


7587 


0 


0 


0 


MO0001388D:G05 


5832 


0 


0 


0 


M00001389A:C08 


16269 


2 


0 


0 


M00001394A:F01 


6583 


0 


0 


0 


M00001395A:C03 


4016 


0 


0 


0 


M00001396A:C03 


4009 


2 


0 


0 


M00001402A:E08 


39563 


0 


0 


0 


M00001407B:DU 


5556 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones it 






Libl2 


Libl3 


Libl4 


M00001409C:D12 


9577 


0 


0 


0 


M00001410A:D07 


7005 


0 


0 


0 


M00001412B:B10 


8551 


0 


0 


0 


M00001415A:H06 


13538 


0 


0 


0 


M00001416A:H01 


7674 


0 


0 


0 


M00001416B:H11 


8847 


1 


0 


0 


M00001417A:E02 


36393 


0 


0 


0 


M00001418B:F03 


9952 


0 


0 


0 


M00001418D:B06 


8526 


0 


0 


0 


MO0O01421C:FOl 


9577 


0 


0 


0 


M00001423B:E07 


15066 


0 


0 


0 


M00001424B:G09 


10470 


0 


0 


0 


M00001425B:H08 


22195 


0 


0 


0 


M00001426D:C08 


4261 


0 


0 


0 


M00001428A:H10 


84182 


0 


0 


0 


M00001429A:H04 


2797 


0 


0 


0 


M00001429B:A11 


4635 


0 


0 


0 


M00001429D:D07 


40392 


0 


0 


0 


M00001439C:F08 


40054 


0 


0 


0 


M00001442C.D07 


16731 


0 


0 


0 


M00001445A:F05 


13532 


0 


0 


0 


M00001446A:F05 


7801 


0 


1 


0 


M00001447A:G03 


10717 


0 


0 


0 


M00001448D:C09 


8 


7 


6 


9 


M00001448D:H01 


36313 


1 


0 


0 


M00001449A:A12 


5857 


0 


0 


0 


M00001449A:B12 


41633 


0 


0 


0 


M00001449A:D12 


3681 


1 


0 


0 


M00001449A:G10 


36535 


0 


0 


0 


M00001449C:D06 


86110 


0 


0 


0 


M00001450A:A02 


39304 


0 


1 


0 


M00001450A:A11 


32663 


0 


0 


0 


M00001450A:B12 


82498 


0 


0 


0 


M00001450A:D08 


27250 


0 


0 


0 


M00001452A:B04 


84328 


0 


0 


0 


M00001452A:B12 


86859 


0 ' 


0 


0 


M00001452A:D08 


1120 


0 


0 


0 


M00001452A:F05 


85064 


0 


0 


0 


M00001452C:B06 


16970 


1 


0 


0 


M00001453A:E11 


16130 


0 


0 


0 


M00001453C:F06 


16653 


0 


0 


0 


M00001454A:A09 


83103 


0 


0 


0 


M00001454B:C12 


7005 


0 


0 


0 


M00001454D:G03 


689 


0 


0 


1 


M00001455A:E09 


13238 


0 


0 


0 


M00001455B:E12 


13072 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 


M00001455D:F09 


9283 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001460A:F06 


2448 


0 


0 


0 


M00001460A:F12 


39498 


0 


0 


0 


M00001461A:D06 


1531 


0 


0 


1 


M00001463C:B11 


19 


17 


32 


31 


M00001465A:B11 


10145 


0 


0 


0 


M00001466A:E07 


4275 


0 


0 


0 


M00001467A:B07 


38759 


0 


0 


0 


M00001467A:D04 


39508 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:D08 


16283 


0 


0 


0 


M00001467A:E10 


39442 


0 


0 


0 


M00001468A:F05 


7589 


0 


0 


0 


M00001469A:C10 


12081 


0 


0 


0 


M00001469A:H12 


19105 


0 


0 


0 


M00001470A:B10 


1037 


0 


0 


0 


M00001470A:C04 


39425 


0 


0 


0 


M00001471A:B01 


39478 


0 


0 


0 


M00001481D:A05 


7985 


0 


0 


0 


M00001490B-.C04 


18699 


0 


0 


0 


M00001494D:F06 


7206 


0 


0 


0 


M00001497A:G02 


2623 


1 


0 


0 


M00001499B:A11 


10539 


0 


1 


0 


M00001500A:C05 


5336 


0 


0 


0 


M00001500A:E11 


2623 


1 


0 


0 


M00001500C:E04 


9443 


0 


0 


0 


M00001501D:C02 


9685 


0 


0 


0 


M00001504C:A07 


10185 


0 


0 


0 


M00001504C:H06 


6974 


0 


0 


0 


M00001504D:G06 


6420 


0 


0 


0 


M000015O7A:H05 


39168 


0 


0 


0 


M00001511A:H06 


39412 


0 


0 


0 


M00001512A:A09 


39186 


0 


0 


0 


M00001512D:G09 


3956 


0 


0 


0 


M00001513A:B06 


4568 


0 


0 


0 


M00001513C:E08 


14364 


0 


0 


0 


M00001514C:D11 


40044 


0 


0 


0 


M00001517A:B07 


4313 


0 


0 


0 


M00001518C:B11 


8952 


0 


0 


0 


M00001528A:C04 


7337 


1 


2 


2 


M00001528A:F09 


18957 


0 


0 


0 


M00001528B:H04 


8358 


0 


0 


0 


M00001531A:D01 


38085 


0 


0 


0 


M00001532B:A06 


3990 


0 


0 


0 


M00001533A:C11 


2428 


0 


0 


0 


M00001534A:C04 


16921 


0 


0 


0 


M00001534A:D09 


5097 


0 


0 


0 


M00001534A:F09 


5321 


4 


7 


6 


M00001534C:A01 


4119 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001535A:B01 


7665 


0 


2 


4 


M00001535A:C06 


20212 


0 


0 


0 


M00001535A:F10 


39423 


0 


0 


0 


M00001536A:B07 


2696 


0 


0 


0 


M00001536A:C08 


39392 


0 


0 


0 


M00001537A:F12 


39420 


0 


0 


0 


M00001537B:G07 


3389 


0 


0 


0 


M00001540A:D06 


8286 


0 


0 


0 


M00001541A:D02 


3765 


0 


0 


0 


M00001541A:F07 


22085 


0 


0 


0 


M00001541A:H03 


39174 


0 


0 


0 


M00001542AA09 


22113 


0 


0 


0 


M00001542A:E06 


39453 


0 


0 


0 


M00001544A:E03 


12170 


0 


0 


0 


M00001544A:G02 


19829 


0 


0 


0 


M00001544B:B07 


6974 


0 


0 


0 


M00001545A:C03 


19255 


0 


0 


0 


M00001545A:D08 


13864 


0 


0 


0 


M00001546A:G11 


1267 


0 


0 


0 


M00001548A:E10 


5892 


0 


1 


0 


M00001548A:H09 


1058 


1 


3 


0 


M00001549A:B02 


4015 


0 


1 


0 


M00001549A:D08 


10944 


1 


0 


0 


M00001549B:F06 


4193 


0 


0 


0 


M00001549C:E06 


16347 


0 


0 


0 


M00001550A:A03 


7239 


0 


1 


0 


M00001550A:G01 


5175 


1 


0 


0 


M00001551A:B10 


6268 


0 


0 


1 


M00001551A:F05 


39180 


0 


0 


0 


M00001551A:G06 


22390 


0 


0 


1 


M00001551C:G09 


3266 


0 


0 


0 


M00001552A:B12 


307 


6 


11 


4 


M00001552A:D11 


39458 


0 


0 


0 


M00001552B:D04 


5708 


0 


0 


0 


M00001553A:H06 


8298 


0 


0 


0 


M00001553B:F12 


4573 


0 


0 


0 


M00001553D:D10 


22814 


0 


0 


0 


M00001555A:B02 


39539 


0 


0 


0 


M00001555A:C01 


39195 


0 


0 


0 


M00001555D:G10 


4561 


0 


0 


0 


M00001556A:C09 


9244 


0 


1 


0 


M00001556A:F11 


1577 


0 


0 


2 


M00001556A:H01 


15855 


1 


1 


0 


M00001556B:C08 


4386 


3 


0 


1 


M00001556B:G02 


11294 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


M00001557A:D02 


7065 


0 


0 


0 


M00001557A:F01 


9635 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001557A:F03 


39490 


0 


0 


0 


M00001557B:H10 


5192 


0 


0 


0 


M00001557D:D09 


8761 


0 


0 


0 


M00001558B:H11 


7514 


0 


0 


0 


M00001560D:F10 


6558 


0 


0 


0 


M00001561A:CO5 


39486 


0 


0 


0 


M00001563B:F06 


102 


2 


1 


2 


M00001564A:B12 


5053 


0 


0 


0 


M00001571C:HO6 


5749 


0 


0 


0 


M00001578B:E04 


23001 


0 


0 


0 


M00001579D:C03 


6539 


0 


0 


0 


M00001583D:A10 


6293 


0 


0 


0 


M000O1586C:CO5 


4623 


0 


0 


0 


M00001587A:B11 


39380 


0 


0 


0 


M00001594B:H04 


260 


1 


0 


0 


M00001597C:H02 


4837 


1 


0 


0 


M00001597D:C05 


10470 


0 


0 


0 


M00001598A:G03 


16999 


4 


2 


6 


M00001601A:D08 


22794 


0 


0 


0 


M00001604A:B10 


1399 


6 


3 


3 


M00001604A:F05 


39391 


0 


0 


0 


M00001607A:E11 


11465 


0 


0 


0 


M00001608A:B03 


7802 


0 


0 


0 


M00001608B:E03 


22155 


0 


0 


0 


M00001614C:F10 


13157 


0 


0 


0 


M00001617C:E02 


17004 


0 


0 


0 


M00001619C-.F12 


40314 


0 


0 


0 


M00001621C:C08 


40044 


0 


0 


0 


M00001623D:F10 


13913 


0 


0 


0 


M00001624A:B06 


3277 


0 


0 


0 


M00001624C:F01 


4309 


0 


0 


0 


M00001630B:H09 


5214 


0 


1 


2 


M00001644C:B07 


39171 


0 


0 


0 


M00001645A:C12 


19267 


0 


0 


0 


M00001648C:A01 


4665 


0 


0 


0 


M00001657D:C03 


23201 


0 


0 


0 


M00001657D:F08 


76760 


0 


0 


0 


M00001662C:A09 


23218 


0 


0 


0 


M00001663A:E04 


35702 


0 


0 


0 


M00001669B:F02 


6468 


0 


0 


0 


M00001670C:H02 


14367 


0 


0 


0 


M00001673C:H02 


7015 


0 


0 


0 


M00001675A:C09 


8773 


0 


0 


0 


M00001676B:F05 


11460 


2 


0 


0 


M00001677C:E10 


14627 


0 


0 


0 


M00001677D:A07 


7570 


0 


0 


0 


M00001678D:F12 


4416 


1 


2 


0 


M00001679A:A06 


6660 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001679A:F10 


26875 


0 


0 


0 


M00001679B:F01 


6298 


0 


0 


0 


M00001679C:F01 


78091 


0 


0 


0 


M00001679D:D03 


10751 


0 


0 


0 


MOOOOI679D:D03 


10751 


0 


0 


0 


M00001680D:F08 


10539 


0 


1 


0 


M00001682C:B12 


17055 


0 


0 


0 


M00001686A:E06 


4622 


0 


0 


0 


M00001688C:F09 


5382 


0 


0 


0 


M00001693C:G01 


4393 


0 


0 


0 


M00001716D:H05 


67252 


0 


0 


0 


M00003741D:C09 


40108 


0 


0 


0 


M00003747D:C05 


11476 


0 


0 


0 


M00003759B:B09 


697 


0 


0 


0 


M00003762C:B08 


17076 


0 


0 


0 


M00003763A:F06 


3108 


0 


0 


0 


MO0003774C:AO3 


67907 


0 


0 


0 


M00003796C:D05 


5619 


0 


1 


0 


M00003826B:AO6 


11350 


0 


0 


0 


MO0003833A:EO5 


21877 


0 


0 


0 


M00003837D:A01 


7899 


0 


0 


0 


M00003839A:D08 


7798 


0 


0 


0 


M00003844C:B11 


6539 


0 


0 


0 


M00003846B:D06 


6874 


0 


0 


0 


M00003851B:D10 


13595 


0 


0 


0 


M00003853A:D04 


5619 


0 


1 


0 


M00003853A:F12 


10515 


0 


0 


1 


M00003856B:C02 


4622 


0 


0 


0 


M00003857A:G10 


3389 


0 


0 


0 


M00003857A:H03 


4718 


0 


0 


0 


M00003871C:E02 


4573 


0 


0 


0 


M00003875B:F04 


12977 


0 


0 


0 


M00003875B:F04 


12977 


0 


0 


0 


M00003875C:G07 


8479 


1 


0 


0 


M00003876D:E12 


7798 


0 


0 


0 


M00003879B:C1 1 


5345 


4 


8 


3 


M00003879B:D10 


31587 


0 


0 


0 


M00003879D:A02 


14507 


0 


0 


0 


M00003885C:A02 


13576 


0 


0 


0 


M00003885C:A02 


13576 


0 


0 


0 


M00003906C:E10 


9285 


0 


0 


0 


M00003907D:A09 


39809 


0 


0 


0 


M00003907D:H04 


16317 


0 


0 


0 


M00003909D:C03 


8672 


0 


0 


0 


M00003912B:D01 


12532 


0 


0 


0 


M00003914C:F05 


3900 


0 


1 


0 


M00003922A:E06 


23255 


0 


0 


0 


M00003958A:H02 


18957 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00003958A:H02 


18957 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


M00003958C:G10 


40455 


0 


0 


0 


M00003968B:F06 


24488 


0 


0 


0 


M00003970C:B09 


40122 


0 


0 


0 


M00003974D.E07 


23210 


0 


0 


0 


M00003974D:H02 


23358 


0 


0 


0 


M00003975A:G11 


12439 


0 


0 


0 


M00003978B:G05 


5693 


0 


0 


0 


M00003981A:E10 


3430 


0 


0 


0 


M00003982C:C02 


2433 


2 


4 


0 


MO0O03983A:AO5 


9105 


0 


0 


0 


M00004028D:A06 


6124 


0 


0 


0 


M00004028D:C05 


40073 


0 


1 


0 


M00004031A:A12 


9061 


0 


0 


0 


M00004031A:A12 


9061 


0 


0 


0 


M00004035C:A07 


37285 


0 


0 


0 


M00004035D:B06 


17036 


0 


0 


0 


M00004059A:D06 


5417 


0 


0 


0 


M00004068B:A01 


3706 


0 


0 


0 


M00004072B:B05 


17036 


0 


0 


0 


M00004081C:D10 


15069 


0 


0 


0 


M00004081C:D12 


14391 


0 


0 


0 


M00004086D:G06 


9285 


0 


0 


0 


M00004087D.-A01 


6880 


0 


0 


0 


M00004093D:B12 


5325 


0 


0 


0 


M00004093D:B12 


5325 


0 


0 


0 


M00004105C:A04 


7221 


0 


0 


0 


M00004108A:E06 


4937 


0 


0 


0 


M0000411!D:A08 


6874 


0 


0 


0 


M00004114C:F11 


13183 


0 


0 


0 


M00004138B:H02 


13272 


0 


0 


0 


M00004146C:C11 


5257 


0 


0 


1 


M00004151D:BO8 


16977 


0 


0 


0 


M00004157C:A09 


6455 


0 


0 


0 


M00004169C:C12 


5319 


0 


0 


0 


M00004171D:B03 


4908 


0 


0 


0 


M00004172C:D08 


11494 


0 


0 


0 


M00004183C:D07 


16392 


0 


0 


0 


M00004185C:C03 


11443 


2 


0 


0 


M00004197D:H01 


8210 


0 


0 


0 


M00004203B:C12 


14311 


0 


0 


0 


M00004212B:C07 


2379 


0 


0 


0 


M00004214C:H05 


11451 


0 


0 


0 


M00004223A:G10 


16918 


0 


0 


0 


M00004223B:D09 


7899 


0 


0 


0 


M00004223D:E04 


12971 


0 


0 


0 


M00004229B:F08 


6455 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00004230B:C07 


7212 


0 


0 


1 


M00004269D:D06 


4905 


0 


0 


0 


M00004275C:C1 1 


16914 


0 


0 


0 


M00004283B:A04 


14286 


0 


0 


0 


M0O004285B:E08 


56020 


0 


0 


0 


M00004295D:F12 


16921 


0 


0 


0 


M00004296C:H07 


13046 


0 


0 


0 


M00004307C:A06 


9457 


1 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


M0OOO4377C:FO5 


2102 


0 


0 


0 
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1 . A library of polynucleotides, the library comprising the sequence information of 
at least one of SEQ ID NOS: 1-844. 

2. The library of claim 1, wherein the library is provided on a nucleic acid array. 

3. The library of claim 1, wherein the library is provided in a computer-readable 

format. 

4. The library of claim 1, wherein the library comprises a differentially expressed 
polynucleotide comprising a sequence selected from the group consisting of SEQ ID NOS:9, 
39, 42, 52, 62, 74, 1 19, 172, 317, and 379. 

5. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human breast cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 
74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219,223, 258, 298,317, 338, 
379, 384, 386, and 388. 

6. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human colon cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 
176, 241, 288, 317, 357, 362, and 374. 

7. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human lung cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 
135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

8. An isolated polynucleotide comprising a nucleotide sequence having at least 90% 
sequence identity to an identifying sequence of SEQ ID NOS: 1-844 or a degenerate variant 
thereof. 

313 



WO 99/33982 



PCT/US98/27610 



9. An isolated polynucleotide according to claim 8, wherein the polynucleotide 
comprises a seqeuence encoding a polypeptide of a protein family selected from the group 
consisting of: 4 transmembrane segments integral membrane proteins, 7 transmembrane 
receptors, ATPases associated with various cellular activities (AAA), eukaryotic aspartyl 
proteases, GATA family of transcription factors, G-protein alpha subunit, phorbol 
esters/diacylglycerol binding proteins, protein kinase, protein phosphatase 2C, protein 
tyrosine phosphatase, trypsin, wnt family of developmental signaling proteins, and 
WW/rsp5/WWP domain containing proteins. 

1 0. The polynucleotide of claim 9, wherein the polynucleotide comprises a 
sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 315, 341, 63, 1 16, 134, 136, 
151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 382, 169, 23, 291, 324, 330, 
341,353,188, 379, and 395. 

1 1 . The polynucleotide of claim 8, wherein the polynucleotide comprises a 
seqeuence encoding a polypeptide having a functional domain selected from the group 
consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 
bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. 

12. The polynucleotide of claim 11, wherein the polynucleotide comprises a 
sequence of one of SEQ ID NOS: 116,251,374,97, 136,242,379,306,386, 18,335,61, 
306,386, 322, 306, and 395. 

13. A recombinant host cell containing the polynucleotide of claim 8. 

14. An isolated polypeptide encoded by the polynucleotide of claim 8. 

15. An antibody that specifically binds a polypeptide of claim 14. 

16. A vector comprising the polynucleotide of claim 8. 
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17. A polynucleotide comprising the nucleotide sequence of an insert contained in a 
clone deposited as ATCC accession number xx, xx, xx, xx, xx, xx, xx, xx, or xx. 

18. A method of detecting differentially expressed genes correlated with a cancerous 
state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample derived 
from a cell suspected of being cancerous, where the gene product is encoded by a gene 
corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 65, 66, 68, 
74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 
379,384,386,388, 1,39, 52, 97, 119, 134,172, 176, 241,288,317,357, 362,374, 9, 34,42, 
62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400; 

wherein detection of the differentially expressed gene product is correlated with a 
cancerous state of the cell from which the test sample was derived. 

19. The method of claim 18, wherein said detecting step is by hybridization of the 
test sample to a reference array, wherein the reference array comprises an identifying 
sequence of at least one of SEQ ID NOS: 1 -844. 

20. The method of claim 18, wherein the cell is a breast tissue derived cell, and the 
differentially expressed gene product is encoded by a gene corresponding to a sequence of at 
least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 
162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388. 

2 1 . The method of claim 1 8, wherein the cell is a colon tissue derived cell, and the 
differentially expressed gene product is encoded by a gene corresponding to a sequence of at 
least one of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, and 
374. 

22. The method of claim 18, wherein the cell is a lung tissue derived cell, and the 
differentially expressed gene product is encoded by a gene corresponding to a sequence of at 
least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 
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SEQUENCE LISTING 

<110> Lewis T. Williams 
Jaime Escobedo 
Michael A. Innis 
* Pablo Dominiguez Garcia 
Julie Sudduth-Klinger 
Christoph Reinhard 
Klause Giese 
Filippo Randazzo 
Giulia C. Kennedy 
David Pot 
Altaf Kassan 
George Lamson 
Radoje Drmanac 
Radomir Crkvenjakov 
Mark Dickson 
Snezana Drmanac 
Ivan Labat 
Dena Leshkowitz 
David Kita 
Veronica Garcia 
William Lee Jones 
Birjit Stache-Crain 

<120> Novel Human Genes and Gene Expression 
Products I 

<130> 2300-1480P 

<140> 09/ 

<141> 1998-12-21 

<150> 60/068,755 
<151> 1997-12-23 

<150> 60/080,664 
<151> 1998-04-03 

<150> 60/105,234 
<151> 1998-10-21 

<160> 844 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 1 

tctcccctga gctgcaggcc tgcatatcca gtaggtctac tggacatctg tactggttgt 60 

tgnggaggaa cctctggctt gctcattaag tcctactgat tttcactatc ccctgaattt 120 

ccccacttat ttttgtcttt cactatcgca ggccttagaa gaggtctacc tgcctccagt 180 

cttacctagt ccagtctacc ccctggagtt agaatggcca tcctgaagtg aaaagtaatg 24 0 

tcacattact cccttcagtg atttcttgta gaagtgccaa tccctgaatg ccaccaagat 300 

<210> 2 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 2 

cccagctgct caggaggctg aggcaggaga attgcttgaa cccaagaggc ggaggttgtg 60 
gtgagccgag attgcacctt tgtactccag cctgggcaac gagcaaaaaa ctctgtctca 120 
aacaaaaaga agaaaaaaaa aaaaaaaann nnnnnnnnnn nnnnnnnnnn nnnttnttct 18 0 

ggcgncnagt cccaaantcn taccttgtaa gacctttann tnncctgngg tntttntnna 240 
cncttanata nnnntntttn ctatcaanta tagggagant tttcntttng gggcaactt 299 

<210> 3 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 3 

atacgattcg aacnnggaca agacgagtat ggaataatat cccactnnnt ttacaatact 60 
ganattatgc ngngatagng cttgttccat tcnaccagcg aatnatgcat tnacncnaca 120 
cnngagttac tatccaaaca cacgttttca cgntacctga ngctggtnga naattatgcg 180 
accatgaggc tttccangat ntttctannt ancagacngn gnacaatgnt gaanaagcng 24 0 
tacacaccgc nctngncnnc cnncactgan cangtnacnn ngctcactgn ngcctctttc 300 

<210> 4 
<211> 287 
<212> DNA 

<213> Homo sapiens 
<400> 4 

aaancngcac gangccacgt ncgnnnngnt nntactnnnc natngccncn tcantggcng 60 
ncagctagac gcctaacagc cgangancca nccntnntgt ganccngtcn tgacngnnag 120 
cntgccggtc ttgctcnttt tgtctaccnn gagganannn ntntgggaca tcccagactg 180 
agtgaggaga tctgcngctg cnnctgtact tggttacanc ncacacgang actntncctt 240 
ggactanana cactagccta anattcngca ctacctantc ctctggc 287 

<210> 5 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 5 

gtccttttga accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga 60 
ctgttctact ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag 12 0 
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gcccggggcg accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc 180 
gaatttcgac ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc 240 
caacccgaga tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc 300 

<210> 6 

<211> 284 

<212> DNA 

<213> Homo sapiens 



<400> 6 

tntccccttt gacgccttan tgccctnncg ctacnngtcc nttaggcctt atcccatcgn 60 

ccntcgtttn gcattctgcc nnagantgac tttncnatca tgcntnatnn gtnnttacna 120 

ggggctnggg tgaattntta caccctgcna ntccatanca cantgccntg cnagctncac 180 

cctcntgaat aaatgcaata aantttcngt tgatcttata caccttatgc nccntantta 24 0 

atcagccctn tnttacnana tcnanttatg cnggtattaa aaca 284 

<210> 7 

<211> 277 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gtgctgcaga caacacacct tcctgatgga ggtgtccggc tgatggagaa gtctgtgggc 60 

ttgtaaatca tctttgatgt taaccaggcc gacgctgtgg ccacattccg aaagattaac 120 

cctgtcaaac cctannnnnn nnnnnnnnnn nnnggatttg atnagcctgt nccanacctc 180 

tgcagcctcn ancggtngtn ntaccatagt ggggatgacc ctctgatact ttgncctggt 240 

ngancatgnt gacanntgct tctacagctt nngggac 277 

<210> 8 

<211> 292 

<212> DNA 

<213> Homo sapiens 



<400> 8 

cttgggaggc tgagtcagga gaattgcttg agcccaggag atggaggttg cagtgagcca 60 

agatcatgcc actgcactcc agactgggca acagagggag actccgtctc aaaaactaaa 120 

aaaaaaaata catttagtat agcggggggt gggcgggaga aataatgtta tttcctatgc 180 

aaatgacgnn nnnnnnnnnn cccatggtaa atgtnaatat actgcgtctn ttttgggana 24 0 

gccttttant aaangagtct tanatgaatc tctanntnat gantttaact tg 292 

<210> 9 

<21l> 300 

<212> DNA 

<213> Homo sapiens 



<400> 9 

ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa gaaacttcaa 60 

gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg tcttctaaat 120 

ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag agtacggtgc 180 

cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat atatcacctt 240 

ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg accaagtgtt 300 
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<210> 10 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 10 

aggaggcgga gcttgcagtg agctgagatc gcgccactgc actccagcct gggcaacaga 60 
gtgagactct gtctcaaaaa aaaaaaaaaa nnnnnnnnnn nnnnnnnnaa nctcgtnttn 120 
gnaaggaaan ggggnaangg accggtntta tncctatgtn gtntttgcag gcaaangaaa 180 
nggacccntt tttgtaaaaa aaagtctttt gnncaantaa acggggtntg ngggtncagg 24 0 
ccctggnggg gcncncantt gcctggnggc ttntgnnaaa tcggnaaagg gaggaaaggc 300 

<210> 11 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 11 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 60 
ggcggtttca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 12 0 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nncgggttct cccaaattnt 180 
ttnnaggggn ccatggncaa ctgnttnacn tttgtttngg naaccccntg cccnaagncg 240 
cananaggct gtnnttnncc ttgttnccaa ggntgaggan caaaaagtac cctntgtttt 300 

<210> 12 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 12 

caaagatggt cgtattacta aaggtgaata accagcgcgg ggggcacgtg gagtcactgg 60 
aacatttgtg caatgctggt gggaatgtca acccgtgcgg ccctctggaa taagcctggc 120 
agctcctcca agagttaccg tgtgacccag caattccact cctagctcca cccacaggaa 180 
ttgaaagcaa agacgcaaac agatgcctgt gcaccaaagt tcacggcagc atccttcgcc 240 
atagtggcag catccgtcgt cacagcggca tcatccttca tcatagcggc agcatccgtc 3 00 

<210> 13 

<211> 278 

<212> DNA 

<213> Homo sapiens 

<400> 13 

cctgcagcca ctaatgcatt gtgtatgata acaaaaactc tggtatgaca cattttctgt 60 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 120 

ggggaggtgt gggaggtttt nncgnnannn nnngcnnnnn annccccggn nngnnngaag 180 

ctgnnnnttn naannngcnn nnnannatga naannnccnn ngactggnnn nangaggcct 24 0 

ancccntgnt ttananaaac nnncnncagn ntctctca 278 

<210> 14 
<211> 300 
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<212> DNA 

<213> Homo sapiens 
<400> 14 

gtgtcttcat cttacccagt ggaacctaag aaattaaatt ctccagaaga aactgctttt 60 
cagacaccaa aatctagcca gatgcctcgg ccttcagtgc caccattagt taaaacatca 120 
ctgttttctt caaaattatc tacacctgat gttgtgagcc catttgggac cccatttggc 180 
tctagtgtaa tgaatcggat ggctggaatt tttgatgtaa acacctgcta tgggtcaccg 240 
caaagtcctc agctaataag aagggggcca agattgtgga catcagcttc tgatcagcaa 300 

<210> 15 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 15 

gttatattaa attattcttt gtttttcttt ttcttttaat aaagcctgca agttactaaa 60 
ttgtagtttc ataaattctg tagtaaagta tcatcttggc agtgtgccaa aggtgaaaat 120 
gatgctttct ctaacagaga aattcttagt gactccagtc gtagaaaaac gtctttacaa 180 
cctgaataag attgaagaat tgtgaacata ccatggccta ttggatgaat catttgccgt 24 0 

aggctaaatc agactgtagg gtttgcgatg gatttatgga gtatgtgggt atagaaatca 300 

<210> 16 
<211> 276 
<212> DNA 

<213> Homo sapiens 
<400> 16 

gtttcattta agaagaatga gctagataaa tgtgctcttc tggttacccc accctgacag 60 
agtgcatttt tacacggcta gcaggggttg agactgcagc ctggcctnnn nnnnnnnnnn 120 
ngnnnnnngc nnacttnact tcccngaanc actataattg gnanacnttn ctaannggtn 180 
atctngccga cctgnnagat anactcnnga taaaanccnn tgcagaaagc gcccttccat 24 0 

gtcangcncn tnaganacnn ncntaccncc tangna 276 

<210> 17 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 17 

ggtgcccatc accacaccca gctaactttt gtatttttag tagagacggg gtttcaccat 60 

gttggccagg ctggtcttga actcctgacc tcgtgatccg cccgccttgg ccccgcaaag 120 

tgctgggatt acaagcatga gcccagcgcc tggctgtatc tttcatttta cccaagtcac 180 

tttacccaag taagtaatta ggggaaagcc tgagtcttgt accacctgtt catttgggga 240 

actgtgggaa acggagccaa cggacctaag tgccctttga cagtgagttt cataccattt 300 

<210> 18 
<211> 273 
<212> DNA 

<213> Homo sapiens 
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<400> 18 

ctcagctgag gcaattaaac tggaaaagaa atagattgaa aagatactac agaagaagca 60 

gtacagaagt tgggggactg aaggagaggg agccactgca ggtgctagct gcttaagggg 120 

ataccagtcc ttttacagat ataatagata cagcttctga ggtggagggt gataggagtg 180 

tgtagagaaa ttgcagttca gaactggagc atgcagttag gcaagaggca tcccatgtga 24 0 

agatgtcaag caagtactgg aaaatgctga act 273 

<210> 19 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagttaaggg 60 

tcttattaaa agccaccact ttgctgaggc ctgtacaggc cttgggggtt tggggaagag 120 

aaataaggca ggcacttgtc ccttcaggga gggacttgtc cctcactggg aggtttgggg 180 

ttgaccttgg ctccagcaga gatacccagc ctggcgtgga aggggcaggt ctgagcttac 24 0 

gcttgactgc agggcaagct gcaggcctct tctgccttcc cctgcattca ccaaggacag 300 

<210> 20 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 20 

atggcatgca ctgacctctt cttggagccc agaactttat agagttgcct accagggtta 60 

ctgtaatgga atttatgatc ttaagaaatt actagttgta ttatttatcc tatgattcat 120 

tcattcaata agcttttact gcataaactt tacatccagc actgtagtta agtacccaaa 180 

attgaataga aataatggct tttgaaaatc gcacaaagca ggccaggcac ggtggctcac 24 0 

gcctgtaatc ccagcatttt gggaggccga ggcaggcgga tcacgaggtc aagagatcca 300 

<210> 21 

<211> 293 

<212> DNA 

<213> Homo sapiens 

<400> 21 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 60 

ggcggttgca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 12 0 

ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnccttng gncgggttnt cccaaattnt 18 0 

tttgaggngn ccatggncaa ctgcttnanc tttgttttgg caaccccntg cccnaagtcg 240 

catataggct gtncttcacc ttgtttccaa ggctgnggaa canaaagtaa cct 293 

<210> 22 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 22 

ctggtctcga acacctgacc tcaggtgatc cattcgtctt ggcctctcga agtgctggga 60 
ttccaggcgt gagccactgc ggccagcaca tttccacttt tagatcctac tccataccac 120 
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aggtttcatt taagaagaaa gagctagata aatgtgctct tctggttacc ccaccctgac 180 
agagtgcatt tttacacggc tagcaggggt tgagactgca gcctggcctg ccagccattg 240 
gaggtgttta aggaagggca gataatgtga ctctttgcgg ggtgccatct gcttacccat 300 



<210> 23 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gaaccaaaga cgtgtatgga gtgttctctt 
tccaagcgac cggatctgag tgatgcttct 
agtagaaagg gtccccattc ctgctcagca 
atgcagacac acacatgcag acaacacgca 
cccatgcaca cacacgtgca cacacatgca 

<210> 24 
<211> 300 
<212> DNA 
<2X3> Homo sapiens 

<400> 24 

cctcccacaa cacgtgggaa ttcaagatga gatttggttg gggacacagc caacccatat 60 

cacccatgcc tggatgccct tctcatgctt gggttctgtc atctgcacca ggccttctgc 120 

tgcccgtctg tcttacccac caggactctg actctccacg ctgggccacc tctcttctcc 180 

aacactgcta tggattgaat gtttatgtta tccccaaatt tgcatgttgc aatcccaatc 240 

tccaatgcca tagtattagg aggtgggggc ctttgggagg tgatttggtc atgaaggtgg 300 

<210> 25 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 25 

ggaaaatgaa atctgactat ctgctagttg 
ttagttggga aagaaggcag cacttgaaaa 
ggaaggggcg tgggacgcac gcggtcactc 
ctgtagctgt gcctctacag actcccgctg 
ccagtaggca aagcttggcc ctattagctt 

<210> 26 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 26 

ctgcagtgag attctctgca atgactggcc tcagcaaggg ggcagcttag gaccctgaca 60 

tcccaggtca ctaagccaca taggataagt aatgggtgga cagaagcggg aaaggagaag 120 

ggcagggcac atgtttaaaa cttgaacttt ctgaggctaa gactggaaaa ggaatggttt 180 

cagctgatat atttggatac cagttgacta tttttaggaa aaaaacacaa atggctttta 240 



gtccttatcg acttgctctg 
agaacatttg ggtgttgggg 
ccgcacctct ctaccccccc 
gacacacaca tgcaggcact 
gagacatgca gacacgcagg 



ctcccagctt 60 

ggttcccaat 120 

acagacacac 180 

cacatgcagg 240 

cacacatgca 300 



ccaaaaccca gaaacattcc 
aatttaccag gttcctcact 
cctctcagcc ccccacattt 
cctggcctcc acagatcctg 
tttctctcca tggctctgtg 



tgtgtaatgg 6 0 

gggagatgt g 120 

ctagaacaca 180 

ctcagattca 24 0 

ggaatgtgcg 300 
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aacatcacag tgtgatacag tctaactcag aattagagac aggcaaaaca gaactccatc 300 

<210> 27 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 27 

gtactgcttc tgtggctctt cacagacctc acggatgtga ccggagatga gtgccgatga 60 
ccacgtttta aaggagaaag agagctcctg gtggggccct cggggtggtc tcaggtccca 120 
tttgcagtct gcaacagtga cgcgcagccc ggtccggagc gtggtgagct ttgtttgcct 18 0 
tctgggtcag ctttcgctgt gtctcctgtg tgtgttagaa tccagagccc agaggaagtg 240 
caagcgggtc ctccgccaac ggggagagcc tcttcgcggc gctgttggcg acagcacgct 300 

<210> 28 

<211> 298 

<212> DNA 

<213> Homo sapiens 

<400> 28 

aangnaannn nggggngttg antcnacctn ngaaccgtgt anaaacccat ggaaacagct 60 

antaganntt gggcagganc agagngaggc caagntacgg gggaggcnag gagcngagan 120 

tggggnnnnn nnangnnaan tnnngaaggg gngngannga gggggggana naagggggga 180 

ngagggcgaa ngncaggann nagaaaannn ggggacgana nggngaacag ggnnnaaacg 240 

gaannnngga gnnnnnanag atgncgggca gngncngngn aggnganann ngagacgg 298 

<210> 29 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 29 

cctcagcccc acaccagctc tatttcaggg gtgagagtca gagagcactg caatatgtgc 60 

ttcatgggat ttcgattcga agatcctaga ccagggagac actgtgagcc agggatacaa 120 

caaaatacta ggtaagtcac tgcagaccga cctccctgca gtttgggaaa gaagctgggt 180 

ttgtggagaa tcagagcatc ttgacatgac tgctgaccta aagatccctg gcattggcca 240 

gggatcctgt ggaacctctt ctagttcagg ggtgtgagca ttagactgcc agttgtctag 300 

<210> 30 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 30 

gtttgtttcc ccgagatgtg aacttgctga aggaaaacag tgtaaagagg aaggccatac 60 

agagaactgt cagctcttca ggatgtgaag gcaagaggaa tgaagacaag gaagcagtga 120 

gcatgttggt taactgccct gcctactaca gtgtgtctgc tcccaaggct gagctactga 180 

acaaaatcaa agagatgcca nnnnnnnnnn nntgaggaag aggaacaggc anatgtcaat 240 

gaaaagaagg ctgatctcat tggaagtctc acccacaagc tggagaccct ccaggaggcg 300 

<210> 31 
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<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 31 

tttaaactga gctccaaatg acgttcaaac acccctctcg ggtagagttt tcatggtgga 60 
acggttgcgc ccaccaaaca gaagcttatg tttttggcac agaaggcctg ggccattttc 120 
atggacacct ggctggacct cggtggaagt gaactccgta ggttgttgcg ttcactgcag 180 
cacctcacat gataccgtcc cctctcatgg aacggagcct cccccatgca gcccccactc 240 
aaatggagtt ttaaaggctg ggttcaggtt acgggggcgt ttctcaccgt ctgaatgcgg 300 

<210> 32 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 32 

gtgaaacaga aagtggagat gctttccttg acctgaagaa gcctcctgcc tccaaatgcc 60 
cccatcgcta tacaaaagaa gaactcttgg atataaaaga actcccccat tccaaacaga 120 
ggccttcatg cctttctgaa aaatatgaca gtgatggtgt ctgggaccct gagaagtggc 180 
atgcctctct ctacccagct tcagggcgga gctcaccagt ggaaagtctg aagaaagagt 240 
tggatacaga ccggccttcc ctggtgcgca ggatagtaga tccacgagag cgtgtgaaag 300 

<210> 33 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 33 

gtctgattga agctgttcag gtttatcatg caaatcctcg cctctggcta cggctggctg 60 
aatgctgcat tgctgccaat aaggggactt ctgaacaaga aactaaaggc cttcccagca 120 
aaaaaggaat tgtacagtct attgttggtc aaggctatca tcgtaaaata gttttggcat 180 
cacagtctat acagaatact gtttataatg atgggcagtc ttcggccatt cctgtagcca 240 
gtatggagtt tgcagccata tgtctcagaa atgccttgtt gctgctacct gaagaacagc 300 

<210> 34 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 34 

tgacagagct gttcagcgta caccagatcg atgagctggc caagtgcaca tcagacactg 60 
tgttcctgga gaagaccagt aagatctcgg accttatcag cagcatcacg caggactacc 12 0 

acctggatga gcaggatgct gagggccgcc tggtacgcgg catcattcgc attattaccc 180 
gaaagagccg tgctcgccca cagacctcgg agggtcgttc aactcgggct gctgccccaa 240 
ccgctgctgc ccctgacagt ggccatgaga ccatggtggg ctcaggtctc agccaggatg 300 

<210> 35 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 35 

cttttttaag caaagcagtt tctagttaat gtagcatctt ggactttggg gcgtcattct 60 

taagcttgtt gtgcccggta accatggtcc tcttgctctg attaaccctt ccttcaatgg 120 

gcttcttcac ccagacacca aggtatgaga tggccctgcc aagtgtcggc ctctcctgtt 180 

aaacaaaaac attctaaagc cattgttctt gcttcatgga caagaggcag ccggagagag 240 

tgccagggtg ccctggtctg agctggcatc cccatgtctt ctgtgtccga gggcagcatg 300 

<210> 36 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gctggccaaa gccaaatctc ctaagtccac cgcccaggag ggaaccctga agcctgaagg 60 

agttacggag gccaaacatc cagctgcagt tcgcctccaa gaaggggtcc atggccctag 120 

tcgagtccat gtgggctctg gggaccatga ctattgtgtc cggagcagga cccccccaaa 180 

aaagatgcct gccctagtca ttccagaggt gggctcccga tggaatgtca agcgccatca 240 

ggacatcacc atcaaacctg tcttgtcctt gggcccagct gcccttcgcc ccatgcatag 300 

<210> 37 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 37 

gtccaaggac aacttcgaga catttctttt tgccaccgta tctaacaggg agcaggaaga 60 
tctctgccga ggaattgtcc agctctgctt caatgagcaa agccaacagc tgctagcaga 120 
ggtccagccc tctgactctt tcctcatggt agagacaact gcatactttg aggcctacag 180 
gcacgtcctg gaaggactcc aggaggtcca ggaggaagat gttcccttcc agaggaatat 240 
cgtggagtgt aactctcatg tgaaggagcc aaggtacttg ctaatggggg gcagatatga 300 

<210> 38 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 38 

catccaggga gaacctcggg gctgggacac ctcctggccc tcaccctggg tcatgtttac 60 
agtcctcagt gccccacacc ggtggccccc tgaggacacc tccaccctga ccttgatttt 120 
cccaaacgct gcctcttggt gacagactca gcccaaaacc ccttccttct gtctctggag 180 
acccttgagc ttggggaaat atggaggggt gtgtgtctgc aatcaaggcc tctgcagctc 240 
acggctggcc cggtgggctg ggacttccgt atgaattnta aatacttagg gttcattttt 300 

<210> 39 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 39 

gggaaggagc gggcgtgagg ccagctgagg catggtgacc cctgggaagg agcgggcgtg 60 
aggccagctg aggcatggcg acccctggga aggagcgggc gtgaggccag ctgaggcatg 120 
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gtgacccctg ggaaggagcg ggcgtgaggc 
gggacttggg ggccgcacct tggtttgccc 
ggaggacggc gtgggatagg ctccctgggt 

<210> 40 

<211> 300 

<212> DNA 

<213> Homo sapiens 



cagctgaggc atggtgaccc ctgggtacgg 
agggcccctc ctgcaccacg ggccacatgc 
ccacagcttc tgcccgtgta tggggaaccc 



<400> 40 

ccaaaagctt gtggcaaatt tgaaatttct gccattaggg accttacaac tggctatgat 60 

gatagccaac ctgataaaaa agctgttctt cccactagta aaagcagcca aatgatcacc 120 

ttcacctttg ctaatggagg cgtggccacc atgcgcacca gtgggacaga gcccaaaatc 18 0 

aagtactatg cagagctgtg tgccccacct gggaacagtg atcctgagca gctgaagaag 240 

gaactgaatg aactggtcag tgctattgaa gaacattttt tccagccaca gaagtacaat 3 00 



<210> 41 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 41 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 

gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 

ttcgaatctg ggattccgct gtgcagccga ccgcctgccc actatggact gacaaccaag 180 

gaaagtcttc cccagtccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 240 

ctttgaacga tcccatgcaa agaattccca ccctgaggtg ggttacatac ctgcccaatg 300 

<210> 42 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 42 

ttctaagtca ggagtacagt acaaaggaca tgtggagatc cccaatttgt ctgatgaaaa 60 

cagcgtggat gaagtggaga ttagtgtgag ccttgccaaa gatgagcctg acacaaatct 12 0 

cgtggcctta atgaaggaag aaggggtgaa acttctaaga gaagcaatgg gaatttacat 180 

cagcaccctc aaaacagagt tcacccaggg catgatctta cctacaatga atggagagtc 24 0 

agtagaccca gtggggcagc cagcactgaa aactgaggag cgcaaggcta agcctgctcc 300 



<210> 43 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 43 

gccaccgaag cttcaggatg acatcttaga ctctcttggt caggggatca atgagttaaa 60 

gactgcagaa caaatcaacg agcatgtttc aggccccttt gtgcagttct ttgtcaagat 120 

tgtgggccat tatgcttcct atatcaagcg ggaagcaaat gggcaaggcc acttccaaga 180 

aagatccttc tgtaaggctc tgacctccaa gaccaaccgc cgatttgtga agaagtttgt 240 

gaagacacag ctcttctcac ttttcatcca ggaagccgag aagagcaaga atcctcctgc 300 
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<210> 44 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 44 

ggcttataca acatagtggg gaacgcatgg gaatggactt cagactggtg gactgttcat 60 

cattctgttg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 120 

aaaggtggat cctacatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 180 

agccagaaca cacctgatag ctctgcttcg aatctgggat tccgctgtgc agccgaccgg 240 

ctgcccacta tggactgaca accaaggaaa gtcttcccca gtccaaggag cagccgtgtc 300 

<210> 45 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 45 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt taacagacac 60 
atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 120 
tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 180 
cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag cgtcggtaaa 24 0 
ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt tgttggccag 300 

<210> 46 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 46 

gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt tangngacac 60 
atctgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 120. 
tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 180 
cagtgcaccc tgctttaaga aaatgaactt atgcaaatag acatccacag cgtcggtaaa 240 
ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt tgttggccag 3 00 

<210> 47 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 47 

acacagataa ttttaataca atgtgaaaaa gtgtatgggt gtgtagaaga ggggttctta 60 

gagtttctgg agagaatgat tctgagctcg gttttgacaa aagaggagct gctgaggcta 120 

aaagtggatg aaaagggcct tataattaaa agaaacaaga caggactcag aggtgtgaaa 180 

caaatattat gcatggtgaa ttacaatgag ttgggggtat tctgtagccc taaagtacaa 240 

ggtataaaga gacagaaaat gatcctggaa tatagacaga ggatacttca tctctcatga 3 00 

<210> 48 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 48 

gatggaacat gagtggaagt gggcagtctt tttctttccc tatcagctga gtgaatgaag 6 0 

atttagaggg cagcagagtc atgacatgga tgacgttggg tctctggatg gctaaatgga 120 

agacccgccc cccaacgcca ctctaccccc ctgctttgaa ctatgctttg agaaatgagc 180 

ttatgagacc actgagactt gggggctgtt tgttcagcag ttcacctaca cttattagga 240 

aaggttgact tcttgtaact acgcctttcc ttaaatcatc ttttgtataa ttctcagaag 300 



<210> 49 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 49 

ccctccccgg cttcccccgg agtgggtcac 
ttcattgact gtcacatgtg gtttgctggt 
atatgctcga tggatagcat tcactggaac 
agactagagg actctgggtt ccttttatgc 
agcaacaaaa ataaagagag gaccaggacg 

<210> 50 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 50 

ctcctgtctc agcctcctgg gtagctggga ctacaggtgc atgccaccat gcctggctaa 60 
cttttgtatt tttagtacag acagggtttc accacattgg tcaggctggt ctcgaactcc 120 
taacctcagg tgatccacct gccttggcct cccaaagtgc tgagattaca ggcgtgagcc 180 
accgcgcctg gcctgattgg ttttttaaca tgatttttct ctaagcttaa ataccacaag 240 
gccaaagaga aatggtcata atttaaacca ttattatatt ggtgaggtat ccctagctat 300 

<210> 51 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 51 

ggaggctaga ctcaagctgt ctggagagtg 
gtgtgactga gcttgatggc caagttgaaa 
gtaaccagga agaccttagt aaggactctc 
gagctggtac cagtatctca gagcctccgt 
gtggaacgct acctcttcct ttgagacctt 

<210> 52 
<211> 300 
<212> DNA 
<213> Homo sapiens 



cacactgttt tttatcatca 
ggcttcccac tggcgaagag 
cactatgaga agattatagg 
aaagtcaact cttctgggtc 
atgccagcac cccgtttatc 



tgggaatcat 60 

aagctacaaa 120 

aaaaacacca 180 

acagttaccc 240 

ctgagtgaac 3 00 



tgaaacaaaa gtgtgtgaag agttgtaact 60 

atcttcattt ggatctgtgc tgccttgctg 120 

taggtcctac caaatcaagc aaaattgaag 180 

ctcctatcag tccgtatgct tcagaaagct 240 

gtggagaagg gtctgaaatg gtaggcaaag 300 
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<400> 52 



atatggtata gttggaaata ggttattgtg agttatttgt 
ttgcatggtg tctaacttct gcaataaatg atctgccagt 
aatttgtttt cctttgtgga tgaagtggga gtaagacttg 
agtggctagg atatggacac actttacttg aattggaaaa 
tgcaaaatgt ggcatgtcac acgtgtaatc tctgaggttt 



agtcatgtct 
cctagtgtct 
ttgctgtgag 
caagccatgt 
agtttttgct 



ttaatggccc 
ggctttatgc 
gattagatga 
atccctaatc 
caagattgca 



120 
180 
240 
300 



60 



<210> 53 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 53 

aagaagctct gcttggtact actattatga acaacattgt tatttggaat ttaaaaactg 60 

gtcaactcct gaaaaagatg cacattgatg attcttacca agcttcagtc tgtcacaaag 120 

cctattctga aatggggctt ctctttattg tcctgagtca tccctgtgcc aaagagagtg 180 

agtcgttgcg aagccctgtg tttcagctca ttgtgattaa ccctaagacg actctcagcg 240 

tgggtgtgat gctgtactgt cttcctccag ggcaggctgg caggttcctg gaaggtgacg 300 

<210> 54 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaagatgcc aatttccatg aagtcttgat ttatatatat gtacacatgt tatgcacata 60 

catgtttgtt ttctaacagt tattttttaa gcttttgaga taattttaga cttacagaag 120 

agttgtaaaa gtagtagagt tcttgtatac tctgcaccca ccttgccctt atgttaacat 180 

cttacgtaac aatagaacat ttgtcaaaat taagaaatta accttgatat aatactaact 240 

aaagtagaaa gtttaaaaag tagagatttt agtcttttca ctaatgtcct tttactgttc 300 

<210> 55 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gggagggacc cttgggggca ggttgtgggt agccagttgc agtctgtggc ctccctcaga 60 

ggtttggagt cgggcgtggc atgctgctgt tggcctcttt ccgagggagt gccatccact 120 

ccctgtccca ccgctgtccg cggtgaggac agtgagggca gtgctacgtg gtggggaggt 180 

gtgtgagaag ccacggaagg gcttcacagg gcagatgcca aggccagtgg gccccggaca 240 

gagtcaggct ccctgggcgg ccttgtgtct tggtggccct gatcatcctg ccaatgcaaa 300 

<210> 56 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ctttgctctc tccattccaa gttgttctct gttctagaaa gcagatgtag tagacatcta 60 
ctgtttttgc ctaaacagaa tccctttttc ctttttttgt taaaagtact catccctaat 120 
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attacattgt tctggaagga ctgaaaataa cagaactcag caeca tgatc ggacegggae 180 
aatcagatta tttcattcct cagcaaaegg agatcgatcc gaaaagtgga aatatgagct 240 
cttctttggt gttggcatat ggaccctgag agaaagaact ttaatttttt ctcttggact 3 00 

<210> 57 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<400> 57 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 60 

ccacccccca cgcccgaaca gcaggggcag agecagnnnn nnntcgaagt gtgtccnngt 120 

tgtcttttga nccttgttnt ggngccttgc ctanatgtat ntnntntnnn tntnntnatt 180 

tnnnnnntnn ntnnnttnct nttnntaaat tgnttnnaan ttntnntann ttnttnnatt 24 0 

nnnannnnnn ntantgtnnt gnattgntat nnatca 276 

<210> 58 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 58 

ctgtaagtct ctttcttgcc catcaccaca tccctagtac tgggtatcag tctggccact 60 
tggctttctg gtttgcccca atgtggtcta ttcttgatgc agctaccaaa gtaatgtttt 120 
aaaaccatta taccaagtta ctatccttgt caaaaccccc agtaactgee aatctcactt 180 
agaataaaat ccggactcct gtgaagcaca gcataaactg gccactgcct atgeagcaac 24 0 
ctcatcttta ccgtttcctg ccttgctcac tcccttccag cgccgttatt cttcctgatg 300 

<210> 59 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gacccaggta gaccagctca agagttcatg ttctttgtca tcctngtgtg agctctctgt 60 

aagtctcttt cttgcccatc accacatccc tagtactggg tatcagtctg gccacttggc 12 0 

tttctggttt gccccaatgt ggtctattct tgatgeaget accaaagtaa tgttttaaaa 180 

ccattatacc aagttactat ccttgtcaaa acccccagta actgccaatc tcacttagaa 240 

taaaatcegg actcctgtga agcacagcat aaactggcca ctgcctatgc agcaacctca 300 

<210> 60 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gggtcctggt gggagttcca tccagcagtg agtgcatttc ttccccagag cagggtaagg 60 

gtcttattaa aagccaccac tttgetgagg cctgtacagg ccttgggggt ttggggaaga 120 

gaaataaggc aggcacttgt cccttcaggg agggacttgt ccctcactgg gaggtttggg 180 

gttgaccttg gctccagcag agatacccag cctggcgtgg aaggggcagg tctgagctta 240 

cgcttgactg cagggcaagc tgcaggcctc ttctgccttc ccctgcattc accaaggaca 300 
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<210> 61 

<211> 292 

<212> DNA 

<213> Homo sapiens 



<400> 61 

caaggcccga ggtgccatcc cctctgggaa gcagaagcct ggnggcaccc agagtgggta 60 

ctgtngnggt aaagngntca ccctctcaca gcaccaccag cggcgagaca gaccccacca 12 0 

ccatcttccc ctgcaaggag tgnggcaaag tcttcttcaa gatcaaaagc cgaaatgcac 18 0 

acatgaaaac tcacaggcag caggaggaac aacagaggcn aaaggctcag aaggcggctt 240 

tngcagctga gatggcagcc acgattgaga ggactacggg gcccgtgggg gc 292 



<210> 62 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 62 

agcaaatcaa gatcttcagg tacagttgga ccaggcactc cagcaagcct tggatcccaa 60 

tagtaaaggc aactctttgt ttgcagaggt ggaagatcga agggcagcaa tggaacgtca 12 0 

gcttatcagt atgaaagtca agtatcagtc actaaagaag caaaatgtat ttaacagaga 180 

acagatgcac agaatgaagt tacaaattgc cacgttgcta cagatgaaag ggtctcaaac 240 

tgaatttgag cagcaggaac ggttgcttgc catgttggag cataataatg gtgaaataaa 300 

<210> 63 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 63 

caggcctgga cttcgccccc aggcctagga 
aacagggact ccaatcaatc ggagttctcc 
cccgagacag tcataaggga tggacttagt 
gtttcttaag gatgctgagg gcatggggcc 
gagcagcctc tcaccactgc cacaaggctc 

<210> 64 
<211> 294 
<212> DNA 
<213> Homo sapiens 

<400> 64 

gctgcatctg caatgaggat gccaccctac 
gtgcccgctg cttccggtgg gtgcaggtgg 
tggatccctg acttgtatcc ctttgttcca 
agagcaccag acatctgcct actctcctcc 
tggtcctccc ggaagggcag tcccacaggc 



ccgcggaggg tggaaccctg 
ccttgccgga gctgcccttc 
tttcttgcag ggaaaaaggt 
aggaccaggg gagaggcaca 
cctaatgctg gtctctgctc 



ctactgcccc 60 

acctttgggg 120 

ggacagccgt 180 

gctccttcct 240 

cactccccgg 300 



gctgcgctgg ctgcgatggg gacctcttct 60 

aatgttctgt gcgagagctc aagggctgcc 120 

cagagagggc catgatgcct ttgagcttaa 180 

acgtgcaggc caagagcact gaagacaccc 240 

agcggcaccc atttctgggc cccg 294 



<210> 65 
<211> 300 
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<212> DNA 

<213> Homo sapiens 

<400> 65 

aattgatgag ccttattaac tatcttttca ttatgagaca aaggttctga ttatgcctac 60 

tggttgaaat tttttaatct agtcaagaag gaaaatttga tgaggaagga aggaatggat 120 

atcttcagaa gggcttcgcc taagctggaa catggataga ttccattcta acataaagat 180 

ctttaagttc aaatatagat gagttgactg gtagatttgg tggtagttgc tttctcggga 24 0 

tataagaagc aaaatcaact gctacaagta aagaggggat ggggaaggtg ttgcacattt 300 

<210> 66 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 66 

agcagatttg tgataaactt gctgtagaag aaaccaaagg ggaacttctg ttgcaactat 60 

gtcgtttgga agatgctgca gatgtttata gaggattgca agagagaaat cctgaaaact 120 

gggcctatta caaaggcttg gaaaaagcac tcaagccagc taatatgtta gaacggctaa 180 

aaatttatga ggaagcctgg actaaatatc ccaggggact ggtgccaaga aggctgccgt 240 

taaacttttt atctggtgag aagtttaaag aatgtttgga taagttccta aggatgaatt 300 

<210> 67 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 67 

tgttcttgta gtgtttgttg ctattgttag aaagattatt agtgatatgt ggggtgtctt 60 

anctaaacaa cagacacatg taagaaaaca ccagtttgat catggagagc tggtttacca 120 

tgcattgcaa ttgttagcat atacagccct tggtatttta attatgagac taaaactctt 180 

cttgacacca cacatgtgtg ttatggcatc actgatctgc tcaagacagc tatttggatg 240 

gctcttttgc aaagtacatc ctggtgctat tgagtttgct atattagcag caatgtcaat 300 

<210> 68 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 68 

agacaaagaa aaggtggcaa tcatagaaga gttagtagta ggttatgaaa 
aagctgccgg ttatttaacc ccaatgatga tggaaaggag gaaccaccaa 
ttgggtccag tactacttgg cacaacatta tgacaaaatt ggtcagccat 
ggagtacata aatactgcta ttgaaagtac acctacatta atagaactct 
agctaaaatc tataagcatg ctggaaatat taaagaagct gcaaggtgga 

<210> 69 
<211> 300 
<212> DNA 
<213> Homo sapiens 



cctctctaaa 60 

ccacattact 120 

ctattgcttt 180 

ttctcgtgaa 240 

tggatgaggc 300 
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<400> 69 

aattcnacac gaggtggccc ataagtttta ccttttaaac atccggctgc 
gaagaataaa atcaatgtgg gaattgggga gataaaggat atccggttgg 
ccacaatgga ggcttcacca aggcgtggtt tgccatgaag acctttctta 
cttcatcatt atggtgtggt attggaggag gatcaccatg atgtcccgac 
tctggaaaaa gtcatctttg cccttgggat ttccatgacc tttatcaata 

<210> 70 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 70 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 60 
ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 120 
a tgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 180 
ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 240 
cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 300 

<210> 71 . 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 71 

ggaaatgcaa gtcaaaacag ctttgtaggt ctcagagttt gcttttaaga agtagtacaa 60 
gaaggaatag ttatatcaat acaccagtgg ctgaaattat catgaaacca aatgttggac 120 
aaggcagcac aagtgtgcaa acagctatgg aaagtgaact cggagagtct agtgccacaa 180 
tcaataaaag actctgcaaa agtacaatag aactttcaga aaattcttta cttccagctt 240 
cttctatgtt gactggcaca caaagcttgc tgcaacctca tttagagagg gttgccatcg 300 

<210> 72 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 72 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 60 
ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 120 
aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 180 
cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 240 
ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

<210> 73 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 73 

ataacacaca tcacagtatg ctctcagaaa tttctttatt tgaaccctat accaatatct 60 
gttgatcaat gaccattttt gctcagcatg gagaaacagt gccctgcatg aagggtagtg 120 



ctgtgaatga 60 

tggggatcca 120 

cgcccagcat 18 0 

ccccagtgct 24 0 

tcccagtagg 3 00 
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agaataaaaa ggatcttacc acctttatca tgagggtggc tttgctctct ccattccaag 180 
ttgttctctg ttctagaaag cagatgtagt agacatctac tgtttttgcc taaacagaat 240 
ccctttttcc tttttttgtt aaaagtactc atccctaata ttacattgtt ctggaaggac 300 

<210> 74 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 74 

cagagtcaac atggagcatc tcactgtgaa atgatccatg gattgaagga tatggtaaaa 60 
tgtttatagg ttactttgaa agtaaaatat actatgtctt ggttttgagg atattggata 120 
caaaactctc ttcctttagg gctactgaga cttgattcct gatcatcaga aatttcacca 180 
gaaacaactt gcttccaata tacccaattc tatatgaaga attcatggag agtgtactgg 24 0 

cactgnnnnn nnnnnnngan ncntgctgct ncgaanntnt nntattnact gannntgaat -300 



<210> 75 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 75 

caagagagag tgatagaatt ggcagtgaaa tatacgaacc accctcctgc cctctgggtt 60 
cacaatacgt gtacacttga ctgtgaagtg gctgtgagag tgggtggaga gttcttcttt 120 
gaccctcagc ctgcggatgc ctctagaaac ctcgtgttga ttgcaggagg agtcggaatt 180 
aaccctctgc tttccatcct gcggcacgca gcagatctcc tcagagagca ggcaaacaaa 240 
agaaatggat atgagatagg aacaataaaa ctattctaca gtgcaaaaaa taccagcgaa 300 

<210> 76 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 76 

gctagacgaa gtggtgaagc ccaaggactt atttttgagc tcgctgtaag actgagaaat 60 

cacgtactcc ttcctgaaac cactaagagg aaaaatgtct gtgacactgc atacagatgt 12 0 

aggtgatatt aaaattgaag tcttctgtga gaggacaccc agaacatgtg agatggagtc 180 

tcgctgtgtc ccccaggctg gagtacaatg gcgcgatctc ggctcactgc aacctccgcc 240 

tactgggttc aagcaagtct tctgcctcag cctcccgaga actgcaagag gaggcaactg 300 

<210> 77 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 77 

agagactttt gtttgtgttt aattagggct atgagagatt tcaggtgaga agttaaacct 60 
gagacagaga gcaagtaagc tgtccctttt aactgttttt ctttggtctt tagtcaccca 120 
gttgcacact ggcattttct tgctgcaagc ttttttaaat ttctgaactc aaggcagtgg 180 
cagaagatgt cagtcacctc tgataactgg aaaaatgggt ctcttgggcc ctggcactgg 240 
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ttctccatgg cctcagccac agggtcccct tggaccccct ctcttccctc cagatcccag 300 

<210> 78 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 78 

caggagcaat caattcctgt cgaagtgaat accatgcagc ttttaacagt atgatgatgg 60 
aacgcatgac cacagatatc aatgcactga agcggcagta ctctcgaatt aaaaagaagc 120 
aacagcagca ggttcatcag gtgtacatca gggcagacaa agggccagtg accagcattc 180 
tcccgtctca ggtaaacagt tctccagtta taaaccacct tcttttagga aagaagatga 24 0 
aaatgactaa cagagctgcc aagaatgctg tcatccacat ccctggtcac acaggaggga 300 

<210> 79 
<211> 278 
<212> DNA 

<213> Homo sapiens 
<400> 79 

gtgctgcaga ggaagacagc ctgtcaggat actgacgagg aggaggaaga ggaagatgat 60 
gatcaggctg aatacgacgc catgttgctg gagcacgctg gagaggccat ccctgccctg 120 
gcagccgcgg ctgggggaga ctcctttgcc ccattctttg ccggtttcct gccattattg 180 
gtgtgcaaga caaaacaggg ctgcacagtg gcagagaagt cctttgcagt ggggaccttg 240 
gcagagacta ttcagggcct gggtgctgct cagcccag 278 

<210> 80 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 80 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 60 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacctgt ttctgggagg 120 

tgagagcagc ccagaacccc tggacaacat cttgttggca gcctttgagt ttgacatcca 180 

tcaagtaatc aaagagtgca gcatcgccct gagcaactgg tggtttgtgg cccacctgac 240 

agacctgctg gaccactgca agctcctcca gtcacacaac ctctatttcg gttccaacat 300 

<210> 81 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 81 

acctgtaccg cctggccact ggctgtcacc ggcgtgatga gctgccggtg ttngaacgca 60 

acctatgctg gactctcccg gcagactgcc tggatatggt cgccatgcag gaagccgccc 120 

agcacctcct cggcacacac gacttcagcg ccttccagtc cgctggcagc ccggtgccga 180 

gccccgtgcg aacgctgcgc cgggtctccg tttccccagg ccaagccagc cccttggtca 240 

cccccgagga gagcaggaag ctgcggttct ggaacctgga gtttgagagc cagtctttcc 300 

<210> 82 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 82 

cccagctgga cctggtggcc ctttcctagt gcctctgctg ggggaggaga acctgggtcc 60 

acgtggaggc taggaggtct caggtgctgc cctggcagca ccagagtgtg ggccgggccc 120 

gagtgtctgc ccctcggccc tcagggtggg gcacttagca cccagaaggg accaaaagca 180 

gggcatggcg gtgcagagga gtttgggagg tgtaaacagc cccatgcacg tggaggagga 240 

gctggctttc agccccagac cccacgctag cactttccac gctgcttgcc cgctgttgat 300 

<210> 83 

<211> 272 

<212> DNA 

<213> Homo sapiens 

<400> 83 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 60 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 120 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 180 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 240 

gaaatggttg ttcatgtcat ttgctcattt tt 272 

<210> 84 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 84 

cccactgccc ccggtcaaca aacccacttt tatgacagtt ttcttccgca gcttggctct 60 

taaattttac tggcaggtgt atggttgttg gagggttcct agtgagttgg gggacctggc 120 

aatagagctg cttggttgga ggaagtgaag ctggcttagt accagcagct gatctcttcc 180 

acgtgctgct gctttttttg ccactctgat actaaaccag agaaagctgc aggtggataa 240 

agaagctgtg gctgtttttt gcttttgggt ggcaatgaga aagagtcaca gtgtgggtta 300 

<210> 85 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 85 

ctgggagcca ccaacatagc agattaccat gtgaagttgc cactgctgca tctcctgaaa 60 

cctggctgat gggagaggtc tcattttgtg tctgagaatg tccaggttgt ctgcagacca 120 

cagcactgat ttcccattag cagttattat ttcctggcca tttcttcctg aaggttttgt 180 

ggttaaactc cctgtcctca atattttatc agcagtaggg ctgtcattct tctggttatc 240 

aacctctaca ttatgaagta aggttcaacc cttctgcttt tctcaggccc ccaaaacggt 300 

<210> 86 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 86 

agaacattgg tgtgtgagtg ttttttgatg gtgcaggacc 
agaatagaaa catccagaat gctcctcccc atcccccaat 
gccctgtaag gcattgcctg ctcttgaccc tttggcccat 
cccatgtcac agatgccctg tctatgcaga gggtggcgtg 
taggctggtg aaggtggtga gcccttctga ggccctgata 

<210> 87 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 87 

ctccaaggaa aatccacctc gcagcttgta aatctacagc ctgattacat caaccccaga 60 
gccgtgcagc tgggctccct tctcgtccgc ggcctcacca ctctggtttt agtcaacagc 120 
gcatgtggct tcccctggaa gacgagtgat ttcatgccct ggaatgtatt tgacgggaag 180 
ctttttcatc agaagtactt gcaatctgaa aagggttatg ctgtggaggt tcttttagaa 240 
caaaatagat ctcggctcac caaattccac aacctgaagg cagtcgtctg caaggcctgc 300 



<210> 88 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 88 

ctgaaacaaa agatgtattt caattaaaag acttggagaa 
gcattactgc tatgtcagta aaagaagtcc ttcaaagctt 
actgtgagag gatcggaact tctaattatt attgggcttt 
caaggaaaca taagttggag gttctggaat ctcagttgtc 
caagcctaca gaaaagcatt gagaaagcta aaattggccg 

<210> 89 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 89 

ggggacatgt gtccctcagc tcagcagagg ctgtggtaca acatggtcct tggtgaagac 60 
ctgcacccct ggaacctccc accatcgtca caactgtagt ctcatttgca gtggagaaaa 120 
gaacccgatg tcccacagcc agatatacac ccagctccat gccagccctt catgtttacc 180 
ttttgctttg ttaattacat gtcagactcc tagagggcct ccagactaat aggaagcatt 240 
tctgtaacca acctgccacc cactgattca gaaatggaaa tcacattcca caatctatgg 300 

<210> 90 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 90 

ctcatacaga aagtcagatc aacaaagagt ccaagaaaaa tgcgacccag ctagaccatt 60 



cggaggtgct 


ttccttgcca 


60 


cccagacagc 


aattatgtca 


120 


ctttttattt 


ttaaaaaatt 


180 


ggatgggtga 


ccactaagtt 


240 


gaactttcca 


ggagttcatg 


300 



gattgctccc 


aaagagaaag 


60 


agttgatgat 


ggtatggttg 


120 


tccaagtaaa 


gctcttcatg 


180 


tgagggaagt 


caaaagcatg 


240 


atgtgaaacg 


gaagagcgaa 


300 
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tgatcccagg cttagcacac gattgcatgg catccccttt agccacttca accactgcag 120 

acatccagga agctggactc tctcctcagt ccctccagac ttctggccac cacagaatga 180 

aaaccccatt ttcaactgag ctatctttgc tccagcctga tactccagac tgtgctggag 24 0 

atagtcatac cccactggct ttttccttca ccgaggactt ggaaagttct tgtttgctag 300 

<210> 91 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 91 

aatgcaaagg gctgcagttc tcattcaggc tactttcagg atgcacagaa catatattac 60 

atttcagact tggaaacatg cttcaattct aattcagcaa cattatcgaa catatagagc 120 

tgcaaaattg caaagagaaa attatatcag acaatggcat tctgctgtgg ttattcaggc 18 0 

tgcatataaa ggaatgaaag caagacaact tttaagggaa aaacacaaag cttctattgt 24 0 

aatacaaggc acctacagaa tgtataggca gtattgtttc taccaaaagc ttcagtgggc 300 

<210> 92 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 92 

aagatatgca gagatattcc aggatctttt 
tattcgccaa cagtgtgttg aatatgtcac 
cattgcactt atcttaccaa gctcttctga 
caattctcta ccaaataaag aattgatgac 
agctaactct gagagcagtt acaactgttt 

<210> 93 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 93 

cgattcgcca gttctccatt ctgagagtca atcacgttcc tgataggttg 
ttttcttcgt tggttttaac cttctaaaca tctccaggcc actttcttag 
ggtactaaaa agaggtccta cccacacctg cctcacactt ctcctttcca 
agtttggagg ggcttgggtg tgtgtgaaca agggccctgc attgtctagg 
ccaggcttgg gttcactttc accatgcatt ggcaaaacta gaaaagtaag 

<210> 94 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 94 

tttgtgcctg agcacccaca atttcaggat 
ggatgtaacc actccttggt gagagaggga 
aggcaatctt cattctgctt ggctttagtc 
aactttgctg ggtgatccca ctgccttgat 



agctttggtg cggtctcctg 
atccattttg cagtctctct 
aggttctatt tctgaactgg 
ctcaatctgt gactgtctgt 
actgacatgt gtcagaacaa 



gagacagtgt 60 

gtgatcagga 120 

agcagctctc 180 

tggctacgct 240 

tgatgtttct 300 



tcattgattt 60 

cctttttcta 120 

aggctgcctg 180 

cctgcagttc 24 0 

cttgtgacaa 300 



ttagactgtg tggcacctca gctttcctct 60 

actcctcacc aatcccattt gacaaaggct 120 

attcttgtca ttgggctgca gaagaaaaac 180 

ttcacctcgg agcgaggctg ggccatgtcc 24 0 
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aagtcttatg aggtcaccct gactagaaaa aattgaactc acctacaaat agtctgaaag 300 

<210> 95 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 95 

gtgagtccga gcatcagtgg cttctggagc agaccagcca cgtggaagag aagccttaca 60 
gagatgggtc ggcagagccc tgctgatggc tgggccttgt gggcagccac tctgtgtgag 120 
cagggtgttg ggcccataca cttcaaagac cagagccctg cactgggaga gtgctcctgg 180 
cccaggctgg gaatcacctt tcgaggccct tcagactctg gcggggcttg ctgtggcctc 24 0 

cctccagcta gtggtgtggc tgagcagact ccagggccag ggccagttcc cttctcccct 300 

<210> 96 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 96 

acaactccag acataattaa agactggccc 
ggctcctctt ccgggagggg cgaggtcggt 
gagacagagg gcaaggacca cggccttgaa 
gattttgagc tcgagggagt gtgccagctc 
cctaaggccg aggaagcctc ttcctgggga 

<210> 97 
<211> 286 
<212> DNA 

<213> Homo sapiens 
<400> 97 

gtccagggcc cangttttaa tttnttttta 
tcacncnnnn nnnnnnnnnn nnnnnnnagg 
tcaagaccag cctgaccagc atggtgagac 
gctgggcgag gtggcaggca cctgtggtcc 
ctcttgaaac tggaaggcag aggttgcggt 

<210> 98 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 98 

caccattttt attttgatgc ttacactcat 
atttaaaaat ccttccagtt aatagagctt 
ctttgttttt cccactttaa agccacaggg 
tgcatgatgt tcaagaccgg cccttggctg 
aggtgtttca ttgtgaactc tagctttgat 

<210> 99 
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aggaggaaga gggcggtggg 
gcagaccttc ctgggagcct 
ctcagcatcc acaggacgcc 
ccagaccagt cgcctcccag 
cagtttgggt tgagttccag 



ctgtggcgcc 60 

gtcactgctt 120 

catcttggag 180 

gaacagcatg 240 

gaagagagtc 3 00 



aaaagcttta ggtcttgccg 
cctaggcggg tggatcacaa 
cctgtctcta ctggaaatac 
cagctacctg ggaggctgag 
gagccgagat tgcgcc 



ggacggtggt 60 
ggtcagcagt 120 
aaaaaaattg 180 
gcgggagagt 240 
286 



ttattctgtt tttgtaaaac agtttcggga 60 

ttgttattat attataattt tgtaaaccca 120 

tcgactcatg gatgatacct ctattgctgc 180 

ttgttacaga gatgttgggc agagctatgc 240 

catggtaaaa agttaaccct ttctattttt 300 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 99 

agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 60 

ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cctccaagtc 120 

tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg ggcttctatc 180 

tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 240 

cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctggcctcca 300 



<210> 100 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 100 

tccaaccctg gcgatgtcac cagcatggtg gctcaggtta gagctctctg aggacccagc 60 

atagagcact ggtgccaggg accaaactga gaccccacca ccgtcatcaa cacttacata 120 

ccataaaggt cttcagagtg ccttggccct agacctccct tcattctttg tagagatgga 180 

atctaagaat gaaacatctc cactcagtcc tgcaaatatg gaagttcttg agataccttt 240 

ttttggtaga tacttgtgct ggtattctga gagtcacttt actctgatgg tttgcaagat 300 

<210> 101 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 101 

gtgtttcttc tacctcccct gcacaacatt gtttatatgc cccctaaaat gtaacttctt 60 

tagattctgt tgttacgtgc aacactgtat atctctccat agcacttaat cagagtttgt 120 

aattaggcat ctttttgtgt gattatttgg taaatgtcca tatcccctac tagcctataa 180 

gctccatgac ttctaggtac cctgtctgac tacgtgtatc actggttcta ccgcctaaca 240 

ttgcctagca cattcattgc ttcacaggca tctgaatatg ggtttataaa atacattgct 300 

<210> 102 
<211> 270 
<212> DNA 
<213> Homo sapiens 



<400> 102 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 60 

tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 120 

tggaggaggt gtnnnnnnnn nnnttnntgn ncccactacc ntgcactgaa ctggccntgt 180 

tacancaann actgnncccn nttgttatna cacctntnac aaacacctgc tgctgtacat 240 

gncnctactt taaggactnn anacctgtgc 270 



<210> 103 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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<400> 103 



gctggagcac gctggagagg ccatccctgc 
tgccccattc tttgccggtt tcctgccatt 
agtggcagag aagtcctttg cagtggggac 
tgcctcagcc cagtttgtgt ctcggctgct 
agaccccgag gtgcgaagca atgccatctt 



cctggcagcc gcggctgggg 
attggtgtgc aagacaaaac 
cttggcagag actattcagg 
ccctgtgctg ttgagcaccg 
cgggatgggc gtgctggcag 



gagactcctt 
agggctgcac 
gcctgggtgc 
cccaagaggc 
agcatggggg 



60 
120 
180 
240 
300 



<210> 104 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 104 

ctcgcgtctc ttcactgcac attgcaatgc atttgcgatt cccatttctc tgctaggagc 60 

cagcctgggt ggcgctgctc ccagagccgt gggtcccaga ccttgcgttc cttttgttcc 120 

tgtccgttta tcaggacacg ggccccacct gtcacgtgcc cgaggccacc caagcccagc 180 

ctgcggggcg ttcccactgc ctggatgccg gcttgagttc tgcgcacgca ggattcagtg 24 0 

tggggacggc ccctgccgga taggcctagc cctggcccag gtggtgagcg gtttgcagtg 300 

<210> 105 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 105 

gggcactgtg gggctctccc cgcctctcct gccttgtttg cccctcagcg tgccaggcag 60 
actgggggca ggacagccgg aagctgagac caaggctcct cacagaaggg cccaggaagt 120 
ccccgccctt gggacagcct cctccgtagc ccctgcacgg caccagttcc ccgagggacg 180 
cagcaggccg cctcccgcag cggccgtggg tctgcacagc ccagcccagc ccaaggcccc 240 
caggagctgg gactctgcta cacccagtga aatgctgtgt cccttctccc ccgtgcccct 300 

<210> 106 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 106 

gctcaacgcc tatgtgaccc atctccatgc cgaatacaat cgacagaagg acatctacct 60 

agcacatcgt gtggcccaag cttgggaatt ggcccagttc atccaccaca catccaagaa 120 

ggcagacgtg gttctgttgt gtggagacct caacatgcac ccagaagacc tgggctgctg 180 

cctgctgaag gagtggacag ggcttcatga tgcctatctt gaaactcggg acttcaaggg 240 

ctctgaggaa ggcaacacaa tggtacccaa gaactgctac gtcagccagc aggagctgaa 300 

<210> 107 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 107 

tgtgagtttc ctatctgttc cagactagta tcgccaatct ctcccagctc tcttctttcc 60 
tccctggcct ttgtcctgca ggaggtagca tcacctcttg gcattttgta catgctttta 120 
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aacaattgga ggagctgccc aggcagtttt atggcctcct ggttgtgtgc cttcacaccc 180 
gcctacagcc ccacctcacc atcaagcgct gagccaatgc gggtgtggct ggccctgagt 240 
tcctgagtca gctccttgcc agggccagag ctggtaacag cggggcagca gggtgggtag 300 

<210> 108 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 108 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 12 0 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tgcgacctcg 18 0 

ggggggccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 24 0 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 300 

<210> 109 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 109 

cacaaggaga agaaagttaa ttaacattga aagatgagaa gacatcttgg aagaacttga 60 

attgggcctt ggaagaagaa cagccattca aatagataga attgtggtag caaaggcata 120 

gaggtaggaa agtatagatc tccagggaca gtagtcatgg ggttggggca ctgttggaat 180 

ttaaggttgg aaggatatat tggagcccct tgaatacggt aacaaggcac accttgggca 240 

gtggagagtt atcagagtgt ttgaaaagga gggttattga gtaaataaat agactggtac 300 

<210> 110 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 110 

gacaccccag atgcagccac caccagcaga agcgatcagc tgaccccaca agggcacgtg 60 

gctgtggccg tgggctcagg tggcagctat ggagccgagg atgaggtgga ggaggagagt 12 0 

gacaaggccg cgctcctgca ggagcagcag cagcagcagc agccgggatt ctggaccttc 180 

agctactatc agagcttctt tgacgtggac acctcacagg tcctggaccg gatcaaaggc 240 

tcactgctgc cccggcctgg ccacaacttt gtgcggcacc atctgcggaa tcggccggat 300 

<210> 111 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<400> 111 

cctggccctg ctgcccctcc tgaatctcgt atgatggtca cagtccggtg gccgtggggg 60 

tgctctgcct tccctggtcc ccactgccca tatctgtgga ctgccccttc caaagacccc 120 

tgggsggggt ggggnnttcc ttctannccn ntacnctatg tgtttaatnn ncntantnct 180 

ttantantat ttnccantgn tnntnatatn nttnnanana nncntnctta nnnacattat 240 

tttanntang ngatnntacc ttnntgnaan g 271 
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<210> 112 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 112 

gttccctcac cttattcctc caagttcccc cttgggaacc tctgagatta acttgataag 60 
ctccttgggc aagctcttta tcctaagatt cctcagtgag ccttatagag ttgctgcgag 120 
aattacattt gttcatgatg tcaagtgtct ggtatgtagc taatgcttat tgaacacata 180 
gtaatttatt gaataattgt catgatcact ggatgagata tagccactgt ggaggtaggc 240 
acaccagggt tttagaggct tgggatcttg caacaggatt ttcctcttgc ctctccaaac 300 

<210> 113 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 113 

cccacatgta ccaggttgag tttgaagatg gatcccagat agcaatgaag agagaggaca 60 
tctacacttt agatgaagag ttacccaaga gagtgaaagc tcgattttcc acagcctctg 120 
acatgcgatt tgaagacacg ttttatggag cagacattat ccaaggggag agaaagagac 180 
aaagagtgct gagctccagg tttaagaatg aatatgtggc cgaccctgta taccgcactt 240 
ttttgaagag ctctttccag aagaagtgcc agaagagaca gtagtctgca tacatcgctg 300 



<210> 114 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 114 

acagttagtg taaaggatct gaatggcata gacttaactc ctgtgcaaga tactcctgtg 60 

gcttcaagaa aagaagatac atatgtacat tttaatgtgg acattgagct ccagaagcat 120 

gttgaaaaat taaccaaagg tgcagctatc ttctttgaat tcaaacacta caagcctaaa 180 

aaaaggttta ccagcaccaa gtgttttgct ttcatggaga tggatgaaat taaacctggg 240 

ccaattgtaa tagaactata cacgaaaccc actgacttta aaagaaagaa attgcaatta 300 

<210> 115 
<211> 288 
<212> DNA 

<213> Homo sapiens 
<400> 115 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 
ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 120 
gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 
gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 
ctcttgtaag gcaggaggat anaanaggac cannaanggn gggagctg 288 

<210> 116 
<211> 300 



28 



WO 99/33982 



PCT/US98/27610 



<212> DNA 

<213> Homo sapiens 



<400> 116 



tcaattagta acatctgaaa aaacagcttt 
ctggaaaaga gcagtgagag gagtccggga 
taacattcac tgggtctgcc aaaaatgtgg 
aaaggaaagg aagagttcta gagataaaga 
acagcctcat gatcacaaac atttaatgcc 



gtcctgggtg 
gatgtgtgat 
atttgtggtc 
actatatgct 
aacccaaatt 



aaaaaggatg 
gcatgtgaag 
tgcttagatt 
tggatgaagt 
atacctggtt 



ccaaaattgc 
caacattgtt 
gttacaaggc 
gtgtgaaggg 
ctgttttgac 



60 
120 
180 
240 
300 



<210> 117 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 117 

gcactttcca gaattctctc atatttgtgg gctgggatca agcctgcagc ttgaggaaag 60 

cacaaggaaa ggaaagaaga tctggtggaa agctcaggtg gcagcggact ctgactccac 120 

tgaggaactg cctcagaagc tgcgatcaca actttggctg aagcccctgc ctcactctag 180 

ggcacctgac ctggcctctt gcctaaacca caaggctaag ggctatagac aatggtttcc 240 

ttaggaacag taaaccagtt tttctaggga tggcccttgg ctgggggatg acagtgtggg 3 00 

<210> 118 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 118 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 60 
gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 
tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 
gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 
cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 300 

<210> 119 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 119 

gaacaaagaa ggaatgtctt cctcatgttt gggtctatag aagacgttaa agaaaacttc 60 

cagaaagtgg gtttgaggca tgagccacca cgcctggcca aaggatttaa tgaattaatg 120 

gatgtacagt gctggggctg gtattctagg gcctgcattg agactcacat tttgccatca 180 

aaagcctttt aagaggtgga ggttgcggtg agctgacatg gtgccactgc actccggcct 240 

gagtgacaga gtgagactct gtctcacaaa aaaaataatg ccctttaaat aatgaataat 300 

<210> 120 
<211> 273 
<212> DNA 

<213> Homo sapiens 
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<400> 120 

cctcagcctt ctaaaaagct ggggctacac ccagctgaag aaattgtaac taaagataga 60 

ttgtttaaag caaagcaaga aacttctgaa gaaatggaac aaagtggaga agcctcagga 120 

aagcccaaca gagagtgtgc accccagatt ccttgtagta ctcctattgc tactgaaagg 180 

acagttgcac atttgaacac tctgaaggac cgtcacccag gtgatttgtg ggcccgcatg 24 0 

cacatctcat cccttggaat atgctgcagg aga 273 

<210> 121 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 121 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 60 
gtcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 120 
tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 180 
gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 240 
cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 300 

<210> 122 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 122 

gttgcaagca gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtt 60 
ccagaatgat acacggatca gtgcagaagt tcatcaggct ctcggacctt agggctgttg 120 
gagaaggctt cagcagcaga actgatggtg aaggctcgtg ttctccatcc tcaactttct 180 
ttgcttcgat catacacaag aatacatttg gaagggcaaa aaaatgaaca ctgtcgttca 240 
ttgcagccgt gttttgtgac acagatgcac agtctgctgt gaagaccttc tctcaagtgg 300 

<210> 123 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 123 

gtgatttcag cttccaaact ggtatacatt ccaaactgat agtacattgc catctccagg 60 
aagacttgac ggctttggga ttttgtttaa acttttataa taaggatcct aagactgttg 120 
cctttaaata gcaaagcagc ctacctggag gctaagtctg ggcagtgggc tggcccctgg 180 
tgtgagcatt agaccagcca cagtgcctga ttggtatagc cttatgtgct ttcctacaaa 240 
atggaattgg aggccgggcg cagtggctca cgcctgtaat cccagcactt tgggaggcca 300 

<210> 124 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 124 

catgctggcc agcatccctg cctgtgcaag ctctggatga gctgtgagcc cctgccaccc 60 
acacccccac tccctgccag cctggcctca gggcctctga tccatgtgca ctggagagga 120 
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gatgactgac agggccactg gggcatttcc acgttaacag cagctgccac tggcaaaaga 180 
agtgactcgc caatggaggc atctcagatg tgggcccagg agtctgggga gctactttga 240 
acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc atgtgctgga 300 

<210> 125 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 125 

ggtaaattgg ttgaattatt gtattgaagc ttgagctgta gctaaaagta atttaggttt 60 
cccctaagat gttattatgt tagggacata acacttttgg gaggttgttg tgggagatgg 120 
ttgatttagg ttttcaaaag ctagaaataa aatttacatg ccttagattt cataaaattc 180 
tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctcctaagg ttatgtccta 240 
ataactattc ttttaggagt atacttctac tttatagaag gttgcttttc tttttaattt 300 

<210> 126 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 126 

tgaagaggag atcggtgacc tgggctcctt atgtgcctga aagagtttga gtttcctgtt 60 
aactccaaat caacagtatt ttcaacaaga aatgtgcaat tgaaatcaag tgctgtttaa 120 
gtgcagctag gatttccaca ggaagacact tgcagtgaac agagttatgg agcagcaaaa 180 
acacagatct atttggaaaa agagaaaaca tatgcgttgt attttgcttc aattataaaa 240 
taccatcctc tcaaaggtgg ttctaaatta caaaggactt tgatttctag gtagattctg 300 

<210> 127 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 127 

ggtgattccc atgctgaaca gtttgatctc ctgccagagt gtcgggccac aaactgggca 60 
gcacatcagg atcacctggg ggccttcaaa aatcaaaaat ccacccccag gccatgccct 120 
ggacccactg caccaggaca agaaatccac cccaggcctc tccccagacc cactgcacca 180 
ggacaagaaa tccaccccca ggccacgccc cagacccact gccctaggat gtgggggtgg .240 
gaaccaggtg gtgctttgta aagacgtgca ggtggtaacc ccaggccccc acgctcggaa 300 

<210> 128 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 128 

tgagctggga gaaggggaga aagtttgtga agaggagatc ggtgacctgg gctccttatg 60 
tgcctgaaag agtttgagtt tcctgttaac tccaaatcaa cagtattttc aacaagaaat 120 
gtgcaattga aatcaagtgc tgtttaagtg cagctaggat ttccacagga agacacttgc 180 
agtgaacaga gttatggagc agcaaaaaca cagatctatt tggaaaaaga gaaaacatat 240 
gcgttgtatt ttgcttcaat tataaaatac catcctctca aaggtggttc taaattacaa 300 
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<210> 129 
<211> 285 
<212> DNA 

<213> Homo sapiens 
<400> 129 

ggaaagcaca aggaaaggaa agaagatctg gtggaaagct caggtggcag cggactctga 60 
ctccactgag gaactgcctc agaagctgcg atcacaactt tggctgaagc ccctgcctca 120 
ctctagggca cctgacctgg cctcttgcct aaaccacaag gctaagggct atagacaatg 180 
gtttccttag gaacagtaaa ccagtttttc tagggatggc ccttggctgg gggatnimnn 240 
nnnnnnnnnn nnnnnnnnnn nnaggaagat accatttctt gacgg 285 

<210> 130 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ccggacgcag gccctcgggc aggagcatct ggcagagtgg ggggcgtggc aggcaccctc 60 
ctttgcaggg cgaggtgggg cctctgcagc catcctggac aggccggggt ggcggcagct 120 
ttgcccacgt ggaagcgggg tgggtctcac ttgcgtggtg gcccctggcc ccatcttgcc 180 
tgctgcggcc tggggagcag gcgctgggtg gtggttctgc ctgcttgctg ctcgttcccc 24 0 

gggcatgcgt gggcagcggg gggcatgcgt gggcagcagg gggccgtggg cagcgggggc 300 

<210> 131 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 131 

gatctctata ctagtgaaca gtgccagttc cacactttgg acttagaact gttctctagt 60 
tattgtaaca cagaatactg tcaatcccta atttacttaa tgttacttat tggaagtggg 120 
gctgatgaaa tacgcacagg agggaaatct actgtgttta ggcacaggca gccccagtgt 180 
ataaggagat catattccaa aaggttgtca gttggttgtt tgcaacctgg aatgtatttt 240 
cctttagaga ccaggttatc catggtggtt aggcccctag agcagctgga aaagatgatc 300 

<210> 132 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 132 

ctcccatgga ggtggtggga atggcaccga gaagtttgat gacagttatc taatggacta 60 
gaggttggca aactttctgt aaatggccag gtagtaaata gttctgcttt tgaaggcata 120 
tggtctcttg cacctactcg aggctgaaag cagctataga caatacataa atgaatgagc 180 
gtgagtgtgt tccaataaga aaaaaacatg gctgtttgct tcggccccag ggttgtagct 240 
taccagtcct gtaacagatc acagtttgct cttttggtca caaatacttg aacccctccc 300 

<210> 133 
<211> 269 
<212> DNA 
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<213> Homo sapiens 



<400> 133 

atgctatgcc aaagcctgct gccagctcca tagcctggac ctacagcact gcatggtgga 60 

gtccacagct gtggtgagct tcttggagga ggcagggtcc cgaatgcgca agttgtggct 120 

gacctacagc tcccagacga cagccatcct gggcgcactg ctgggcagct gctgccccca 180 

gctccaggtc ctggaggtga gcaccggcat caaccgtaat agcattcccc ttcagctgcc 240 

tgtccaggct ntgcaaaaag gctgccctc 269 



<210> 134 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 134 

gatggatgag actgttgctg agttcatcaa gaggaccatc ttgaaaatcc ccatgaatga 60 

actgacaaca atcctgaagg cctgggattt tttgtctgaa aatcaactgc agactgtaaa 120 

tttccgacag agaaaggaat ctgtagttca gcacttgatc catctgtgtg aggaaaagcg 180 

tgcaagtatc agtgatgctg ccctgttaga catcatttat atgcaatttc atcagcacca 240 

gaaagtttgg gatgtttttc agatgagtaa aggaccaggt gaagatgttg acctttttga 3 00 



<210> 135 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 135 

ggcgagcggg aacagctctt gaggagtgag actgcaggag atgtgggccg tgccaaagag 60 

atggatgaga ctgttgctga gttcatcaag aggaccatct tgaaaatccc catgaatgaa 120 

ctgacaacaa tcctgaaggc ctgggatttt ttgtctgaaa atcaactgca gactgtaaat 18 0 

ttccgacaga gaaaggaatc tgtagttcag cacttgatcc atctgtgtga ggaaaagcgt 24 0 

gcaagtatca gtgatgctgc cctgttagac atcatttata tgcaatttca tcagcaccag 3 00 



<210> 136 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 136 

gacttctaaa tatatcttgg atataatagg tgataagttc tgtcaattag taacatctga 60 

aaaaacagct ttgtcctggg tgaaaaagga tgccaaaatt gcctggaaaa gagcagtgag 12 0 

aggagtccgg gagatgtgtg atgcatgtga agcaacattg tttaacattc actgggtctg 180 

ccaaaaatgt ggatttgtgg tctgcttaga ttgttacaag gcaaaggaaa ggaagagttc 24 0 

tagagataaa gaactatatg cttggatgaa gtgtgtgaag ggacagcctc atgatcacaa 300 

<210> 137 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 137 



33 



WO 99/33982 



PCT/US98/27610 



ttgacaaatt gctggaacac acttattgtg gtttacccgg ttttaattat gtcagagatt 6 0 

gcatcatcct tatgcttgtt tacatctata atcttctatg aaatggtggt accaaggggc 120 

gcccaacagc ttttatcccc attcttagag catattcttt attataatga ttatccaaca 180 

tatttcttta attttaatac aaaaaataca tcatttaatt tttgttacat atgaacattc 240 

atttttaaat gctcagcctc aagtgcaggc atttttgagt ggcctgatta catattcctc 300 

<210> 138 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 138 

ggaaggggag ggttggtgag tcccagacct taaaaataca aggttaagag ggaccccaaa 60 

gcaaaaaatt ccaacccttt tcctcccagt cattgaaaca ccaaaactat tataccggag 12 0 

ggtgtaatag ttttgctgcc cagttgtggt aggccagtag tggcctccca agatgcccat 180 

gtcctaatcc caggaacctg tcaaaattac cttgtatggc caaaggggct ttgcagatgt 24 0 

aatgaagtta aggatctttc gccaggaaga ttatcccagc ttgttcagga gggcttgatg 300 

<210> 139 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 139 

gacatcattt tcttattcta gtaagagaaa gtacacagat tcaactttag agaggacttt 60 

tttttttctg gagctaaatc aaggaaggat tatcacgtgg cctcccttga atataatttt 120 

gaagctgtga acagtaccat cagtaacatt ttatggacag ctctgatggt ttttatacca 180 

cggcactctt cttacctttg ggggaagcta tctggagtta tgactgatgt gtaaagtggt 240 

ttactgttag aatcctggtg tgctaggatt ctgggagagt cactttcagg aagttacctg 300 

<210> 140 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 140 

gctgcccagg cagttttatg gcctcctggt tgtgtgcctt cacacccgcc tacagcccca 60 

cctcaccatc aagcgctgag ccaatgcggg tgtggctggc cctgagttcc tgagtcagct 120 

ccttgccagg gccagagctg gtaacagcgg ggcagcaggg tgggtagcct ctaccagcca 180 

gggcagtccc tgaggggcca gcaggggggc tgactgccta gtggctcaac ctcctgaacc 240 

cacccactcc cagcgatgct acccagaacc ccaacggcat gaatcctgca cagtgccggg 300 

<210> 141 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 141 

cccaaactta tcgggggtgc cagaggcaga gtagacaagc cttagtggcc gccatttgtt 60 

gaatatctac tgtgcgccaa gcagtgcgtc acaactttat gaagtaggta ttattatcat 120 

ccccatttta caggtgaaga aactgagtct ctgagagacc aacttttcca aggtcacaca 180 
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gaggtgggat ccagcccact tccgtctgac cccaagcccc tgctgttaac ccctgcccca 240 
ttgtggggag gttccggccc actctggagt tctctggtct gcgtcagtcc tcaggagaag 3 00 

<210> 142 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 142 

gaaaggtggc gcgcttctca cggctgagtt gctgcgcctg cagacggaag ctccccacag 60 

gcagagctgc ttggatgtgt gagtcatgaa gccagagaag ccccgctcca tgagcagtga 120 

ctccccaggc cctgtgacct ccctcctgtc ttgcagctcc tcctggcacc agtccccagg 180 

gctctcctgt tggtagttcc tgcttttctt cttggaaatt cctcgtggac ctcgagatct 24 0 

ttaccctaaa atagttctgt tgaatttcac cctggcaatg taaattgata gcttatcttc 300 

<210> 143 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 143 

cttggccttg cttctctgag aaaactttgg tcacacctcc agagccaggg tgggtgcctc 60 
cctggaggag ggggctttcc tggttggtgg cacagcagga gtccaggctt tgtaccgtgg 120 
acaccatggg ctatggcaac accttcctca ccatccttcc atgaggacct cgggagagag 180 
tggacatgaa accctttgtg ctctgaagca ttcaacagaa gctttctggt tctgtgccta 240 
tttctttggc acttgagcgt gtttgcaggt tcattacaca catgatgaaa gctctggccc 300 

<210> 144 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 144 

cctgactgag tgcctgacgg tggaccccct cagtgccagc gtctgaaggc agctgtaccc 60 

taagcacctg tcacagtcca gccttctgct ggagcacttg ctcagctcct gggagcagat 12 0 

tcccaagaag gtacagaagt ctttgcaaga aaccattcag tccctcaagc ttaccaacca 180 

ggagctgctg aggaagggta gcagtaacaa ccaggatgtc gtcacctgtg acatggcctg 24 0 

caagggcctg ttgcagcagg ttcagggtcc tcggctgccc tggacgcggc tcctcctgtt 3 00 

<210> 145 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 145 

gccagagcct agaggagaga tcaaagaccc tggccgaagt gaagcccatt ctgcaagcaa 60 

ctgggttccc atggcatgtg gtggccttag aggaggtgtt cagcctgcca ccgtcggtgc 120 

tttggtgctc tgcccaggag ctggtgggat ccgagggggc ctacaaggcg gccgtggaca 180 

gcttcctcca gcagcagcat gtgctggggg ccgggggtgg tcctggnccg actcaagggg 240 

annnnnnnnn nnncncaacc cccgctggac ccccngaanc tggcaagacc ngctgcccct 300 
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<210> 146 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 146 

tgactttgta cctggtccaa gctgttgggg aattgctgct gttgacccag gcaggagtct 60 

gactagagaa caaactaagg ttgctgcaac aaacaaggac ctcttccaag aagggctccc 120 

aggcctggcg cagtgactca tgcctgtgat cccagcactt gggaggccga ggcgggtgga 180 

tcatttgagg ccaggagttc gagaccagct tggccaacat gatgagaccc cgtctctatt 240 

aaaaatacaa aaattagcca ggcgtggtgg cgcctgtagt cccagctact caggaggttg 300 

<210> 147 
<211> 295 
<212> DNA 
<213> Homo sapiens 

<400> 147 

ggnaangcna nngnaggaga nagagaagna ncagtnnagn cccangaaac ccnntgaaac 60 

ccttagaagn cagaggagng aaaggangaa aaananggnn ggangagaac nnannnnggn 12 0 

caaannaagg anganngnta ggngngaaaa anaanaacaa anggggaaaa ngggaaaaaa 18 0 

ggcganaaag gnaanannag nanaaggngg aananannnn annagaaagg ncaanaaaag 240 

aagnacaaag aaaaangana anaagnaann annanannga cagagacaag aagga 295 

<210> 148 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 148 

cgctgtgctt gagaccaacc tgacgggtac cttctacatg tgcaaagcag tttacagctc 60 

ctggatgaaa gagcatggag gatctatcgt caatatcatt gtccctacta aagctggatt 120 

tccattagct gtgcattctg gagctgcaag agcaggtgtt tacaacctca ccaaatcttt 180 

agctttggaa tgggcctgca gtggaatacg gatcaattgt gttgcccctg gagttattta 240 

ttcccagact gctgtggaga actatggttc ctggggacaa agcttctttg aagggtcttt 300 

<210> 149 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 149 

agtgtcagtt ttcctaatct cagtccaggt aggaattaag aaatatctca agtgttgatg 60 

ctatccaagc atgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 120 

aatatctttt cttttgagag tacccccagt ttatttctac tgtgctttat tgctactgtt 180 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc atagcttttt aggacttggt 240 

gttaaaggag ccagtggtct ctctgggtgg gtactataat gagttattgt gacccacagc 300 

<210> 150 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 150 

tgtagacttt atgtcagttc tgtgtagact ttatgtcagt ttttgtcatt atttgaaaat 60 

ctattctgac aactttttaa ttcctttgat cttataagtt aaagctgtaa caactgaaat 120 

tgcatggatc aagtaagcat agttttatcc agggagaaaa ataaaaggaa gccatagaat 180 

tgctctggtc aaaaccaagc acaccatagc cttaactgaa tatttaggaa atctgcctaa 24 0 

tctgcttata tttggtgttt gttttttgac tgttgggctt tgggaagatg ttatttatga 300 

<210> 151 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 151 

gcgggcccgg ccagcggaag cccctgcgcc cgcgccatgt caaagaaaaa aggactgagt 60 

gcagaagaaa agagaactcg catgatggaa atattttctg aaacaaaaga tgtatttcaa 12 0 

ttaaaagact tggagaagat tgctcccaaa gagaaaggca ttactgctat gtcagtaaaa 180 

gaagtccttc aaagcttagt tgatgatggt atggttgact gtgagaggat cggaacttct 240 

aattattatt gggcttttcc aagtaaagct cttcatgcaa ggaaacataa gttggaggtt 3 00 

<210> 152 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 152 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 60 
atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 120 
tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 180 
gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 240 
cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 300 



<210> 153 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 153 

cagagaccag ccttctccag aggctgtcac tgcaggagcc gtgggcctgg gaagacttgg 60 
aagcggcctc tctcaactgg tttctgtctc cgtggagctg gaactgcctg cacttgcctt 120 
cagagggagg cacagtccac ccagatccac ctttccagca agacccccag tggctgccca 180 
gcctgggagc acctctttgc ttttcacacc aaaccaaaac tggcgagagc ccctcctagc 240 
caccagtgat ccccaagcat ccagtacaga accaggcatc gagctagctc cctgcacggc 300 

<210> 154 

<211> 300 

<212> DNA 

<213> Homo sapiens 
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<400> 154 

cttgacctct gtactttaaa gaaatcacta accaaatttt caaagtttcc ttttaaatgc 60 

gtttagctag aaatctatgt atttatccct ttcctatttt gcattcttct cccactattt 120 

ttaaaaactc atttacagta gaaaccattc ttctttctcc caacagtatc ctttgccaag 180 

accatgagaa cagtatggga gcatgttgtt ggtcagggtt tcagaatacg cgtgatgtca 240 

ctgagaatgt ttgctcacag tcaataattg tctttgtgga tgtgataatt ttggagatac 3 00 

<210> 155 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 155 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 60 
ggggaggtgt gggaggtttt a 81 

<210> 156 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 156 

ggcagcacaa gtgtgcaaac agctatggaa agtgaactcg gagagtctag tgccacaatc 60 

aataaaagac tctgcaaaag tacaatagaa ctttcagaaa attctttact tccagcttct 120 

tctatgttga ctggcacaca aagcttgctg caacctcatt tagagagggt tgccatcgat 180 

gctctacagt tatgttgttt gttacttccc ccaccaaatc gtagaaagct tcaactttta 240 

atgcgtatga tttcccgaat gagtcaaaat gttgatatgc ccaaacttca tgatgcaatg 3 00 

<210> 157 * 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 157 

ctggtgagga gtctttgcga gagcgaggag cagcggttac tggaacaggt gcatggcgaa 60 

gaggagcggg cccaccagag catcctgaca cagcgggtgc actgggccga ggcgctgcag 120 

aaacttgaca ccatccgcac tggcctggtg ggcatgctta ctcacctgga tgacctccag 180 

ctgattcaga aggagcaaga gattttcgag aggaccgaag aagcagaggg cattttggat 240 

ccccaggagt cggaaatgtt aaactttaat gagaagtgca ctcggagccc actactgacc 300 

<210> 158 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 158 

cgacagctct ccaatactca ggttaatgct gaaaaatcat ccaagacagt tattgcaaga 60 

gtttaatttt tgaaaactgg ctactgctct gtgtttacag acgtgtgcag ttgtaggcat 120 

gtagctacag gacattttta agggcccagg atcgtttttt cccagggcaa gcagaagaga 180 

aaatgttgta tatgtctttt acccggcaca ttccccttgc ctaaatacaa gggctggagt 240 

ctgcacggga cctattagag tattttccac aatgatgatg atttcagcag ggatgacgtc 300 
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<210> 159 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 159 

agtacccaga gttgcgagga gttttttaac tgatttagcc aggtggcaat catgagtgaa 60 
tggatgaaga aaggcccctt agaatggcaa gattacattt acaaagaggt ccgagtgaca 120 
gccagtgaga agaatgagta taaaggatgg gttttaacta cagacccagt ctctgccaat 180 
attgtccttg tgaacttcct tgaagatggc agcatgtctg tgaccggaat tatgggacat 240 
gctgtgcaga ctgttgaaac tatgaatgaa ggggaccata gagtgaggga gaagctgatg 300 

<210> 160 
<211> 294 
<212> DNA 

<213> Homo sapiens 
<400> 160 

ctttgagcta ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt 60 
ggaaatgtga tcttcccata tcatcaagaa acttgttttc tggatgaata ctgggagaat 120 
aaaatgagaa ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac 180 
agaaggtata attttttgac accctttccc acctggtgcc tatgctaggc ttgtnctgat 24 0 

aacatccctc actnactnga tnntcacatn gnncttncnc tgangtccca tttt 294 

<210> 161 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 161 

cttcctcaaa gcatggttgc tgagtaccca gagttgcgag gagtttttta actgatttag 60 
ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 120 
ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 180 
tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 240 
tgtgaccgga attatgggac atgctgtgca gactgttgaa actatgaatg aaggggacca 3 00 

<210> 162 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 162 

gccccgtgtg gggagacgga cagcaccctc ctcatctggc aggtgccctt gatgctatag 60 

cgcctcccct ctcccctcag agggcacagc tgcaggcctg accaaggcca cgcccggctc 120 

tcgtgctcta ggacctgcac gggacttgtg gatgggcctg gactctccag aaactacttg 180 

ggccagagca aaggaaaacc tcttgtttta aaaaaatttt tttcagagtg ttttggggag 240 

gagttttagg gcttggggag agggaggaca catctggagg aaatggcctt ctttttaaaa 300 

<210> 163 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 163 

gaccggctgg gcctacaaaa agatcgagct ggaggatctc aggtttcctc tggtctgtgg 60 

ggagggcaaa aaggctcggg tgatggccac cattggggtg acccgaggct tgggagacca 120 

cagccttaag gtctgcagtt ccaccctgcc catcaagccc tttctctcct gcttccctga 180 

ggtacgagtg tatgacctga cacaatatga gcactgccca gatgatgtgc tagtcctggg 240 

aacagatggc ctgtgggatg tcactactga ctgtgaggta tctgccactg tggacagggt 300 

<210> 164 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 164 

aaaatttata ngtaatgaca aatgacttat cagtgttcat catctgaaag ctaagtggtt 60 

cgttcaatca ctttttcaaa gttgatagta gattgcatgg tttcatgttt cctcatattg 120 

gtttattaat tctatttaat caaggaaaat aacttcagat tccataaagt ttcagtttat 180 

ttttagttta ctactaggtg agatagcaca ttacatactt ttactatcaa atattatttt 240 

agcagcttcc catagtacca aatgatttga ttccctactc tcatttttta aagcatataa 300 

<210> 165 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 165 

ctggactctg agtcgtcttg gtcccaggag ccagtagtga aggcaacagt ctgcccacct 60 

gtggacacca gatcctggga gctcctggtt agcaagtgag atctctggga tgtcagtgag 120 

gctggttgaa gaccagaggt aaactgcaga ggtcaccacc cccaccatgt cccaggtgat 180 

gtccagccca ctgctggcag gaggccatgc tgtcagcttg gcgccttgtg atgagcccag 240 

gaggaccctg cacccagcac ccagccccag cctgccaccc cagtgttctt actacaccac 300 

<210> 166 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 166 

cttctgttga ttggtttgtt taaagtacct aagtactacc ctttgactcc ctaccaaaag 60 
ttcttttgtt ttttaaacaa cttttatttg tgacttactt tcttgagaag tgttcttaat 120 
gaattgcata aaatagtggt agcagcttat ttcttaagta ctttattatt tgtgctttac 180 
catttcaggt tcttatcttt aacccttatt tactcagttt tccatctgaa tgatcctatc 240 
tctaaattaa ggatttaata aatgctgcaa attgtccact ttgcaaattg tccaaaagct 300 

<210> 167 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 167 
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gcgagatgaa gctacactgt gaggtggagg tgatcagccg gcacttgccc 
ttaggaaccg gggcaagggc gtccgagccg tgttgagcct ctgtcagcag 
gtcagccgcc ggtccgagcc ttcctgctca tctccaccct gaaggacaag 
gctatgagct aagggagaac attgagcaat tcttcaccaa atttgtagat 
ccactgttcg gttaaaggag cctcctgtgg atatctgtct aagtaaggat 

<210> 168 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 168 

gtctgggcag cctacgcttt ccggataaaa atggcagaat gaaagaatta 
tagagaatag gaaagacatg aaccaacgcc caaaatgaga aagaaggaca 
aagacaaata caagtgaaaa aaatatacta atggattaac gtccctgtcg 
tctgactatg gaaatgatat tagacaaaaa gcaacttcaa gtgggtttct 
caaaatgggt cataacgcag catagataac ttgaaacatg aacagcgcat 

<210> 169 
<211> 296 
<212> DNA 
<213> Homo sapiens 

<400> 169 

gagatctctg ggatgtcagt gaggctggtt gaagaccaga ggtaaactgc 
accctcacca tgtcccaggt gatgtccagc ccactgctgg caggaggcca 
ttggcgcctt gtgatgagcc caggaggacc ctgcacccag cacccagccc 
ccccagtgtt cttactacac cacggaaggc tggggagccc aagccctgat 
cctncattgg gncccctggc tanttcancn agncccncag gtngagncca 

<210> 170 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 170 

gggtgttgga gcagattgta gttgatccac agcaaagagc atcaccaaag 
aggaactaga tccaccactt cctctgctgg gcatgctcca aaaatggttg 
agaggactcc aaaagaaagc acaaaaacta gacagtggga gggcataccc 
agtttctgaa aaaatattga aagtttctat ggtgaaatag gaagttaatg 
gaaaaaagtg gtaatgattc aaggaaacat aatcacacac ggttttagtt 

<210> 171 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 171 

atggaggcac cagcaggtag tggcccctgt aagcagggcc agagtcggga caaagagcag 60 

gagtgaagca gccaagagac agaggaccag gctggagcca gtgggcacgc aggagcctgc 120 

ctgggaaaag ccggggggca aggctggcat gggaatgaac acctgctggt gacacctctc 180 



gccttggggc 60 

acttccagga 120 

cgcgggaccc 18 0 

gaggggaaag 240 

tccatatggc 300 



tgagtggaac 60 

tataaagaaa 120 

agtgacattt 180 

tatttgagtt 240 

ttggcccagg 300 



ggaggtcacc 60 

tgctgtcagc 120 

cagcctgcca 180 

ggccccgtgc 240 

aagcca 296 



ccattccagg 60 

tggcttccag 120 

aaaagccctg 180 

tgcttaggaa 24 0 

ttaatggaca 300 
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tgagcttcag ttcccttaac tagaaaaata gaacaggccc ggtgcggtgg ctcatacctg 240 
taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcaggag atcaagacca 300 

<210> 172 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 172 

ggcggaggag cagaagctca agctggagcg gctcatgaag aacccggaca aagcagttcc 60 
aattccagag aaaatgagtg aatgggcacc tcgacctccc ccagaatttg tccgagatgt 120 
catgggttca agtgctgggg ccggcagtgg agagttccac gtgtacagac atctgcgccg 180 
gagagaatat cagcgacagg actacatgga tgccatggct gagaagcaaa aattggatgc 240 
agagtttcag aaaagactgg aaaagaataa aattgctgca gaggagcaga ccgcaaagcg 300 

<210> 173 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 173 

gtctttccca ttcacttctc tagaaagctg ccaagacaga ggcagaaaga aatggatgat 60 
agttctgtca agcacacttc tgttctctta gaacttagaa gtgtttctaa gagaacagaa 120 
gtaataagag aaacagttac gtgtggaatt caacatcttt ggttggaacg cattggcttt 180 
ttttttcttg ttttgataga aatggaatta agcaaaagta gtttttgtct tttctgttgt 240 
cttcaaattt caggccatct atttttaatt taatcccgtt caagtacttg attgttatac ° 300 

<210> 174 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 174 

attatttcca aagcagccta cagtagaaaa tagtcattat ggcagcagct tctgatgttt 60 
ttgtttggta ggttttctga tttcaatata tagaatcata ttcatagagt atcttctttt 12 0 

aacgaattgc acaaagtacc catttaaaat ttacatgcac agttcattgc cacctttctt 180 
aggcctatgc atagttaata aggttataat ctactcaaca tggaaaatgg agcctatttg 24 0 
caaacacaca agtaattaaa gtaccaattc tctcttagtt tcttttttta tagttggttt 300 



<210> 175 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 175 

tgganactct ttantatgga aggtgaattt cctgtcaaca tagtccagga caaagcagtt 60 
ccaattccag agaaaatgag tgaatgggca cctcgacctc ccccagaatt tgtccgagat 120 
gtcatgggtt caagtgctgg ggccggcagt ggagagttcc acgtgtacag acatctgcgc 180 
cggagagaat atcagcgaca ggactacatg gatgccatgg ctgagaagca aaaattggat 240 
gcagagtttc agaaaagact ggaaaagaat aaaattgctg cagaggagca gaccgcaaag 300 
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<210> 176 
<2X1> 300 
<212> DNA 
<213> Homo sapiens 

<400> 176 

tataaacttt attttattct cttctggggt agagttacat gacaagaaat tgaattaatt 60 

caataaaatt ttagttcggg ttgcttaggt ttttactgct cccattcttg cttttactaa 12 0 

tttatccaag attagatgtg attactattt aataataatt tagtcctcac acttacaaac 18 0 

cacttacaat accagcatgc ttctatcact gtaattctat tcaattctca ggcccatgag 240 

gcatgccagc cagacgacca gacagcattt atagagaggg cactcaatac cagccacaaa 300 

<210> 177 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 177 

gactggagaa gtcagaagta gaaaagcaga ttgctaggag agacaggatg acagattttg 60 

gtcagaaaat gggatattgg agtttaaagt atcaaataca gaatagttcc agatgttcag 120 

agatccagca tgggattagg tactgaaatg gattagaact aaaagtcact agaatttaga 180 

aattgagaac catgagagtg gatgcaatga cttgttgctt gattgaaaaa taaattaata 240 

ataataaagg accatgagac tagcctgtta tagggggtat ctccatgann nttgtttttc 300 

<210> 178 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 178 

tcctggtgtc aaacactata aacctttgac cagctgagct gtgactgctg tcacatatct 60 

gagtcctgtg tgcacagtaa tatcctgggt caggtaaaat ccaggtcttc aagttttaag 120 

gattttttga agaattcggg cttctttaag acgatccatg cccaaatcca caagcttgtt 180 

gacagtggat tacagtttgt gtggcaaagt ccaagttgtt acactgtgct ttaaaaaaaa 240 

tcttatctgc atgtattgtt aacttagaga ccatgagatc tatttatcag gaccaggaag 300 

<210> 179 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 179 

ctcatgcctg taatcccagc actttgggaa gcagaggtgg caggatcatt ccagcccagg 60 
agttcaagac cagcctgggc aacacagtga gtgagaccct gtctctattt aagaaaaaat 120 
aattaagaaa ttttattaaa aaagaagaat caggaaacca agtccaaccc aactaaacct 180 
caaatgaacc agcccctaac acagatgagg ggatttggga ctgataagct ctgtgctgtg 24 0 
tccatggccc gtcatttatc aaggctgcag ctttgtaaat gtggctattt ttatgttgtg 300 

<210> 180 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 180 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 12 0 

gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 180 

gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 

ctcttgtaag gcaggaggat agaagaggac caagaaggga gggagctgcc ctgtggtgca 300 

<210> 181 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 181 

cccatgccgg gatcttccca cacccgtcct cacagatcca gccccagccc cttgcttccc 60 

aggccatctc tcagcagcac ctgcaggatg cgggcacccg ggagtggagc cctcagaacg 120 

catccatgtc ggagtctctc tccatcccag cttccctgaa cgacgcggct ttggctcaga 180 

tgaacagtga ggtgcagctc ctgactgaaa aggccctgat ggagcttggg ggtgggaagc 240 

cgcttccgca cccccgggcg tggttcgtct ccttggatgg caggtccaac gctcacgtta 3 00 

<210> 182 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 182 

tttgcagtgt tgtcagaaac aaataataaa gccccaaaag attaactagt tgaaaaaact 60 
ggcaaaatct gtatacgtgg aaatttacca ggacagagac tgaagaataa agaaaatgag 120 
tttcattgcc agatcatgaa atccaaagaa actttaaaga agatgagttg tgtaaatgga 180 
actgaaggga gggaagagct gccttcgcct gggacaaaga aaacatgtgt atacacatgg 240 
gtcaagcagt gctggtctgt ggctgcctgt ccagaggaat ggaaatatcc cttgtcttta 300 

<210> 183 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 183 

cggacccatc ggagcgtaac ctggatctcc gcaggcctgg 
gggcattgct tggttcgcgt ggtagcagag gagcttgaga 
acagttcttt acatggctga ttcagaaact ttcattagtc 
aagagagcaa ggaaaagaac tagtatggaa acagcacttg 
aaacaatgcc aagtccttgg gattgtgacc ccaggaattg 

<210> 184 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 184 
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ctgttttgca gatgaggaaa ctgaggtaca gaattcttag ggaacttacc caaaatggct 60 

tttctgcact ctgccctttg gtattgtccc atgtgaattg tttaaaactt atgtgtatag 120 

tggcatgagt aggtgatttc agaaacagaa ctcacttttg ttgtttggtc ttaaaattag 180 

gaacttttct tcatctgggc ttcatttccc tgcaccttcc cagctttcta gtcatgcaag 240 

ccacatgtct ccacgtgagg ggttcattgg aaagcagcca cagagccacc ccctggctgg 300 

<210> 185 
<211> 260 
<212> DNA 

<213> Homo sapiens 



<400> 185 

attatagaga ttaatctcct ttgctcgaag tctatttaaa tattagtcac atctaaaaca 60 

tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 

cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 

cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 

ctcccacctt taacatagtt 260 

<210> 186 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 186 

gataaactct tcagtgacga atattagaaa aagttagtta tacatttgag gaaaactata 60 

aaagtaccaa taatgagtag gaaatcactt ctgcagtatt tttggagcat tttccttaag 120 

catgacataa aagccaaagg tcacaaggga aaaaactgat agatttgtct gtgatattga 180 

gagatgtatg cacatataca tacaacagtc atagtaagac accgttagac aaaaggtgat 24 0 

gtatgaaaaa gaggcaaaac aacaagaaga aaagattgaa aaaatgagag ctgaagacgg 300 



<210> 187 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 187 

aaaaagtaaa gcttttcatg agcacaaatc ccttgcattg tttgatgtta ctgatattcg 60 

taaaatgaat attttttgtt ttgttttgtt ttattttttt gagacaagtc ttgctttgtt 120 

gcccaggctg gagtgcaatg gcatgatctt ggctcactgc aacccctgcc ttgcgagttc 180 

aagtgattct tctgcctcag cctcctgagt agctgggatt acaggcgctc accaccacac 240 

ccagctaatt tctgtatttt tagtagacac agggttttac catgttggcc angctggtct 300 

<210> 188 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 188 

gagcattcct cctttgttaa cgaagcaaca tttacacaag atggacatta cattattagt 60 

gcatgctctg atggcactgt aaagatctgg aatatgaaga ccacagaatg ttcaaatacc 120 

tttaaatccc tgggcagcac cgcagggaca gatattaccg tcaacagtgt gattctactt 180 
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cctaaaaacc ctgagcactt tgtggtgtgc aacagatcaa acacggtggt catcatgaac 240 
atgcaggggc agattgtcag aagcttcagt tctggtaaaa gagaaggtgg ggactttgtt 300 

<210> 189 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 189 

ctaatatcca gaatctacaa agaactcaac aagaaaaaaa ccaaccccac aagcgggcaa 60 

aggacatgaa cagacatttc ccaaaagaag acatacaagc aacctaaaat aatctaaaat 120 

aatttttaaa aagaaaaaat gcttgacaga gttttgatag tacttagtaa aaagttatat 180 

ctagtggctt tttgtttgtt tgtttttgtt ttgtttttaa gaaatagtct ctgtttccca 240 

agctggagta cagtggcgca atcttggctc actgcaacct cgaactcctg ggctcaagcg 300 

<210> 190 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 190 

aaccactatg gaggcatgat tggtggccac tacactgcct gtgcacgcct gcccaatgat 60 
cgtagcagtc agcgcagtga cgtgggctgg cgcttgtttg atgacagcac agtgacaacg 120 
gtagacgaga gccaggttgt gacgcgttat gcctatgtac tcttctaccg ccggcggaac 180 
tctcctgtgg agaggccccc cagggcaggt cactctgagc accacccaga cctaggccct 240 
gcagctgagg ctgctgccag ccagggacta ggccctggcc aggcccccga ggtggcccca 300 

<210> 191 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 191 

gcggcgctga cccggccggc cccacacccg ctcttcctct tctttgccgc ggactccctt 60 

tcctgcctcc aagacctggt gtctcccact gtgagcccag ctgtcccaca ggcagtcccc 120 

atggacctag actcaccttc cccttgcctc tatgaacctc tgctgggccc agcccctgtc 180 

ccagctcccg acctgcactt cctgctggac tcaggcctcc agctccctgc ccagcgagcg 240 

gcctcagcca ccgcctcccc tttcttccgg gccctgctgt caggcagctt tgcagaagcc 300 

<210> 192 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 192 

gacagaccgt tgagaggacg tggaggcccg agagggggta tgcgcggcag aggcagaggt 60 

ggccctggga acagagtttt tgacgctttt gaccagagag gaaagcgaga atttgaaaga 120 

tatggtggga atgacaaaat agcagtcaga actgaagaca acatgggtgg atgtggagtt 180 

cgaacctggg gatcgggtaa agataccagt gatgtggagc caactgcacc gatggaggaa 240 

cccacagtgg tggaggagtc ccagggcacc ccggaagagg agtctccagc caaagttcct 3 00 
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<210> 193 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 193 

ctcaagaaag gagaagtttt tttgtatgaa attggaggaa atattgggga acgctgcctt 60 

gatgatgaca cttacatgaa ggatttatat cagcttaacc caaatgctga gtgggttata 120 

aagtcaaagc cattgtagaa gacttaacaa gctgcagata accatgtgga cttctgtcat 180 

aattcttgct gagtcaagag tgtaaataaa agaaatggca ggactcatat tattcagttg 240 

tacccaagta tttaaaaatg actctcttaa gccttaaaaa gtcatagatt tgtgctgctg 300 

<210> 194 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 194 

cagaagctta gtcatatttc aaaatgatca aatatcaaga aaaattctga gctgcataac 60 
ttgtataaag taattttcag tgattttttt catggttatg ataaaagaac tggattagca 120 
gaaactttta ccctgaatca agatttaatt tttctttgag ctcatcttaa ggatatcgga 180 
acatagggag caaacgatgg tgtggctgcc tcagtgcttg atttttaacg gttttgaaga 24 0 

gaatagttac atttcttctc ctagtaagaa ctaataaata cattaacaga aatgaattcc 3 00 

<210> 195 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 195 

ctctactaaa aatacaaaaa ttagctgggc gtggtggcac acacctgtaa tcccagttac 60 

ttgggaggct gaggcacaag aatcgcttga acccgggagg cggaggttgc agttagccaa 120 

gatcgccctg ctgcactcca gcctgggcaa cagagggaga ctctgtctcc aaaaacaaaa 180 

acaaaaactg ttagtgaagg ttccctggga cttttgatat tttaaaaatt gatcttatga 240 

ctaagtagat aaattcattg ccataatgag gctagctccc agataaacag cgtattttct 300 

<210> 196 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 196 

tggatactga caatggtggc aggcatttca agccttttaa attagtactt 
tgcttattaa aattttgtta attttagcaa agaccaattg ttgtgataaa 
ttggatgctt caagcacacg ttaaccaatt ttttaattcc ccttttggtt 
ttctaaaata ggactttcat attattaaaa cctcaaaaga tgatccaccc 
aagatcacca aggggaaaga aaacattttt tatctttaca gaaaacatgt 

<210> 197 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 197 

atccagatgg gatacctcta aacacgaaaa gaaagaagat tccattagtg aatttttaag 60 

tttggctaga tcaaaagccg agccacctaa acaacagtcc agccccttag taaacaaaga 120 

ggaagagcat gcaccagaat catccgcaaa tcagacagtc aacaaagatg tggacgcaca 180 

ggctgaagga gaagggagcc gcccatccat ggacttattc agggccatct ttgccagttc 240 

ctcagatgaa aagtcctcat cctccgagga tgagcaaggt gacagtgaag atgatcaggc 300 

<210> 198 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 198 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 60 
caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 120 
gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 
ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 240 
tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 300 

<210> 199 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 199 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 60 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 120 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 240 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 300 

<210> 200 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 200 

agtagaaaaa tacaaagact gtgatccgca agttgtggaa gaaatacgcc aagcaaataa 60 

agtagccaaa gaagctgcta acagatggac tgataacata ttcgcaataa aatcttgggc 120 

caaaagaaaa tttgggtttg aagaaaataa aattgataga acttttggaa ttccagaaga 180 

ctttgactac atagactaaa atattccatg gtggtgaagg atgtacaagc ttgtgaatat 240 

gtaaatttta aactattatc taactaagtg tactgaattg tcgtttgccc tgtaactgtg 300 

<210> 201 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 201 
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ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 180 

ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 

agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 

<210> 202 

<211> 281 

<212> DNA 

<213> Homo sapiens 

<400> 202 

ggccatggga cagttgcaac agcagttaaa tggactgtca gtcagtgaag gtcatgattc 60 

tgaagatatt ttgagcaaaa gtaacctgaa cccagatgcc aaggagttta ttccaggaga 120 

gaagtactga gccgagaaag ctttgaggaa gacttgtctg tccccacatc tggggatagt 180 

aatgcccaaa atggtggagc tgaagagggg gatggggcgg gcgaggggtg cacagcggga 240 

aggggagtgg tggtctcacg atactgtgac tctgagtaac t 281 

<210> 203 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 203 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 
cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 
tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 
ctccagccct tgctccactc atgcctgggg gcctggggct gagtggtatc cctacctggc 240 
ctccccctgg cctctgggcc tccagcgctg ggtttgtcga gtgagagaga gagaggagct 300 

<210> 204 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<400> 204 

gcggactctc aggacgaaaa gagccaaacc tttttgggaa aatcagagga agtaactgga 60 

aagcaagaag atcatggtat aaaggagaaa ggggtcccag tcagcgggca ggaggcgaaa 120 

gagccagaga gttgggatgg gggcaggctg ggggcattgg gaagagcgag gagcagggaa 180 

gaggagaatg agcatcatgg gccttcaatg cccgctctga tagcccctga ggactctcct 240 

cactgtgacc tgtttcagga gcctcatat 269 

<210> 205 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 205 

ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 
tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 
ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 180 
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ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 
agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 

<210> 206 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 206 

gggattacag gcatgaccca ccgcgcccag cctgtaattt cttatacttg gtattttgta 60 

cttggattat gcttctgata cgctataatt atttatgtac atgttatttt tcttcaatag 120 

actgtgaact cttcgaatgt aggactccta gagctagata ctcaattatt ttttattaaa 180 

ttgaatgact tgaaactaca gatcctttat ttaaacttcc caaatttctg ctttatctag 240 

gcaactcttt aaattctttg atctcatgta gattccaaag gctgaaataa ttgagatttt 300 

<210> 207 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 207 

tcctgaagct cggggggctg caggtcctga ggaccctggt gcaggagaag ggcacggagg 60 
tgctcgccgt gcgcgtggtc acactgctct acgacctggt cacggagaag atgttcgccg 120 
aggaggaggc tgagctgacc caggagatgt ccccagagaa gctgcagcag tatcgccagg 180 
tacacctcct gccaggcctg tgggaacagg gctggtgcga gatcacggcc cacctcctgg 240 
cgctgcccga gcatgatgcc cgtgagaagg tgctgcagac actgggcgtc ctcctgacca 300 

<210> 208 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 208 

attccaaagg tttcaaagaa cttggtcata aatatgataa tgagaagaca aagtatttat 60 

attaaaacag tttagtagcc ttcagttttg tgaaaatagt tttcagcaca gaaactgact 120 

tctttagaca aagttttaac caatgatggt gtttgcttct aggatataca ctttaaaaga 180 

actcactgtc ccagtggtgg tcattgatgg cctttagtaa attggagctg cttaatcata 240 

ttgatatcta atttctttta accacaatga attgtcctta attaccaaca gtgaagcact 300 

<210> 209 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 209 

gagacagcag cccccaggga atgaagctga tgccagagtc agacccgagg aggaagagga 60 

gccactgatg gagatgcggc tccgggatgc gcctcagcac ttctatgcag cactgctgca 120 

gctgggcctc aagtacctct ttatccttgg tattcagatt ctggcctgtg ccttggcagc 180 

ctccatcctt cgcaggcatc tcatggtctg gaaagtgttt gcccctaagt tcatatttga 24 0 

ggctgtgggc ttcattgtga gcagcgtggg acttctcctg ggcatagctt tggtgatgag 300 
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<210> 210 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 210 

gtaacgtgac acgtatttta cttcttttag taggcggaca cactttctta aagtggtaat 60 
acgtcatggc cctgctataa ggtagtagtt ctagaagact gtttatctaa taattcagac 120 
taaagctatt tatattgctg tgacaccacg tggaaaactt ttataattcc atcttatttc 180 
tgatgtatat gttttatttt ctctgccttc ataagaacta aaaaccaaag ttatttacgt 240 
gaaaacaaga tttttgtttg agttcattta cttgagatat gtttaaaaaa tccaccttct 300 

<210> 211 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 211 

gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 
cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 120 
aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 180 
aatttggaga aagagctcca ggagatggag gcacggtacg agaaggagtt tggagatgga 240 
tcggatgaaa atgaaatgga agaacatgaa ctcaaagatg aggaggatgg taaagacagt 300 

<210> 212 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 212 

gcctgctgct tcatgccgcc ggcgtcctgc tccacgtctc tgtgctgctg ggccctgcac 60 

tgtcggccct gctgcgagcc cacacgcccc tccacatggc tgccctcctc ctgcttccct 120 

ggctcatgtt gctcacaggc agagtgtctc tggcacagtt tgccttggcc ttcgtgacgg 180 

acacgtgcgt ggcgggtgcg ctgctgtgcg gggctgggct gctcttccat gggatgctgc 240 

tgctgcgggg ccagaccaca tgggagtggg ctcggggcca gcactcctat gacctgggtc 300 

<210> 213 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 213 

ggtatggttg gagtgtagga atgaatattc atgaaatgtt 
taattcatac aatgaatgta tttggaatac ttacatatta 
caagaggtat cctgttctgt aagatcagat gtttttattg 
agagacagaa aataccccct tatcagtccc ttagtgcctc 
agaaaaccca tgctgaaaag attgtacttt gtgatcccaa 

<210> 214 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 214 

ggaaagggcc ctaaaagaga tgaacaatac ccgtatcatg tggaatgaat tagaaaccct 60 

tgtcagagcc catatcaaca actcagagaa acatcaaaga gtcttggaat gtctgatggc 120 

atgcaggagc aaacccccag aagaggaaga acgaaagaaa cgaggaagaa agagggaaga 180 

caaagaggac aagtcagaga aagcagtgaa agattatgaa caggaaaagt cttggcaaga 240 

ctcagagaga ttaaaaggaa tcttagaacg tggaaaagaa gaattggctg aagctgagat 300 

<210> 215 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 215 

atacttttta aacctttttt ggcagctcag atggtgtaaa ttttaaaatt ttgtataggt 60 
atttcataac aaaaatatgt atttcttttt tgttatttta tcttgaaaac ggtacatatt 120 
ttagtatttg tgcagaaaaa caagtcctaa agtatttgtt tttatttgta ccatccactt 18 0 

gtgccttact gtatcctgtg tcatgtccaa tcagttgtaa acaatggcat ctttgaacag 24 0 

tgtgatgaga ataggaatgt ggtgttttaa agcagtgttg cattttaatc agtaatctac 300 

<210> 216 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 216 

gcagatattt actgaaggaa tctaggttgt attttcagtg gacaatggga ataaagcatt 6 0 

tctaaagcac cgactggaga ggaaggcaac agagacaagg agagaagccg agagacatgt 120 
ctgcgtgctg ccacgcatct gagcgattgc tctgtgaaga gttgtacact gaacattttc 180 
aggggaggct gtttacccag gcaatgtcct caaacaagcc tgtgccgggg agtcctggaa 240 
tctgtgccag gactgtgttt ttagcccttc acctctcagc tttagcagga catgaaccag 300 

<210> 217 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 217 

cccccatctt cactggttat tccacttatt taaaatgtcc agaataagca aatctccata 6 0 

tagaggaagt agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc 120 
ttttgcagtg ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata 180 
tgcttaagac cattgaatta cacactttac gttggtgaat tgtatggtat gtaaattata 240 
gttcaataac atagttacaa aagataatca aaagcatgaa agcactgttg atgtggtttg 300 

<210> 218 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 218 
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acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 60 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctgcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccttgacc ttgtactaca 300 

<210> 219 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 219 

caactagaga agattggaca gcaggtcgac agagaacctg gagatgtagc tactccacca 60 

cggaagagaa agaagatagt ggttgaagcc ccagcaaagg aaatggagaa ggtagaggag 12 0 

atgccacata aaccacagaa agatgaagat ctgacacagg attatgaaga atggaaaaga 180 

aaaattttgg aaaatgctgc cagtgctcaa aaggctacag cagagtgatt tcagcttcca 240 

aactggtata cattccaaac tgatagtaca ttgccatctc caggaagact tgacggcttt 300 

<210> 220 
<211> 260 
<212> DNA 
<213> Homo sapiens 



<400> 220 

ggtaagtcag gtgattgaat cccggaaagg ttcattgtct tcaagctcac aatactattt 60 

tgggacaaac agttgtctag tgtttggact catgaaccct gattcttgag ggtggtattt 120 

tactgctttt gtgatttggt ttcaacatat atagtctttt ctccggagtt accttaggtc 180 

agtggccagt gtttcagccc ctggaaaggg catgggctgc cactgaggtt ggtcacaggc 240 

ctctcagctc atggtgggag 260 

<210> 221 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 221 

gggttccatc ccttccaccc aggaaatgga ggcacgactt gcagcgttgc agggcagagt 60 

tctaccttct caaacccccc agccggcaca tcacacaccg gacaccagga cccaagccca 120 

gcagacacag gatctgctaa cgcagctggc agctgaggtg gctatcgatg aaagctggaa 180 

aggaggaggc ccagtgaccc tccaggacta tcgcctccca gacagtgatg acgacgagga 240 

tgaggagaca gccatccaaa gagtcctgca gcagctcact gaagaagctg ccctgatgag 300 

<210> 222 

<211> 300 

<212> DNA 

<213 > Homo sapiens 



<400> 222 

gcggtgaccc acgtgtcctg catgattgcc ctactgctgt ggagacctcg tgctgaccat 60 

ctggcagtgt tcttcgtatt ctctggcctg tggggcgtgg cagatgccgt ctggcagaca 120 

caaaaeaatg ctctctacgg cgttctgttt gagaagagca aggaagctgc cttcgccaat 180 
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taccgcctgt gggaggccct gggcttcgtc attgccttcg ggtacagcac gtttttgtgc 240 
gtgcacgtca agctctacat tctgctgggg gtcctgagcc tgaccatggt ggccgtatgg 300 

<210> 223 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 223 

gccccctctg gatcctgagc tccctgctct agacagtgat ggtgattcag atgatggcga 60 

agatggtcga ggtgatgaga aacggaaaaa taaaggcact tcggacagct cctctggcaa 12 0 

tgtatctgaa gggggaaagc cctcctgaca gccaggagga ctctttccag ggaagacaga 180 

aatcaaaaga caaagctgcc actccaagaa aagatggtcc caaacgttct gtactgtcca 240 

agtcagttcc tgggtacaag ccaaaggtca ttccaaatgc tatatgtgga atttgtctga 300 

<210> 224 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 224 

ctgcggccgc aggagctgtg gcggttttcc taatcctgcg aatatgggta gtgcttcgtt 60 

ccatggacgt tacgccccgg gagtctctca gtatcttggt agtggctgag tccggtgggc 120 

ataccactga gatcctgagg ctgcttggga gcttgaccaa tgcctactca cctagacatt 180 

atgtcattgc tgacactgat gaaatgagtg ccaataaaat aaattctttt gaactatgat 240 

cgagctgata gagaccctag taacatgtat accaaatact acattcaccg aattccaaga 300 

<210> 225 
<2ll> 300 
<212> DNA 

<213> Homo sapiens 



<400> 225 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtagt tcctgctttt cttcttggaa attcctcgtg gacctcgaga tctttaccct 120 

aaaatagttc tgttgaattt caccctggca atgtaaattg atagcttatc ttcacagatg 180 

ccagacaatg gacaactcac catcagtcct ctgctcacct gagacaaatg catgtctgat 240 

tgcttcctct gccctattgt ttatgtgaaa atgcagattc actgagccag actaaggcat 300 

<210> 226 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 226 

tatataacaa cttttgcttt caaagttggg tgggactaga acacacaatg gaaggatgga 60 

gtcaggagac ctggattctt gtgcccgctc tggcttttac agtctgccta actctatgca 120 

gtcacttcct gccagcctgt ttccttacct acaagaggga gagacactcc ctggccagcc 180 

tagttctcag ggtgaacgaa aggtcattat cactgcatcc tctagtcatt tgcttcttcg 24 0 

ctaattaaca catcttgagc acctgcgatg ttccaggaac aggagatggc agcgtgcaag 300 
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<210> 227 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 227 

ttgctgaaat gggcacttct gctgtgatgc tagtgttgat tttgctctca gatgaacaca 60 
atgtctcata ctaaccaaga agcaagaaaa gccccatgca ttcatttttc acttggagtg 120 
acaatgggag aggtcaggaa tcaagttcac tttcaagatc taagggagtc cactatctgt 180 
gcaattgtat ttggcttttt tttgcactgt ttcaatgctg gtaattgaaa ccattttaat 240 
atatttggtt gtattcactt tatatgtcct tccaaaaatg ttgttgtgta cataccatgc 300 

<210> 228 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 228 

gctatgtatt gtgtcctacc atgaattcac tccatgctag ccacattggc ctgtatggct 60 
attccttgga cacacctagg atgttcttgc ctcttagctt gcctaccttt ctctcatcat 120 
ttgggcctca gcgaggatat catctcctca gagaagcctt ctgtgaccat gctatctaaa 180 
atactccagc acttcagtca ccctttatcc cattactctg ctttttcaga aacattggtg 240 
ctccctgaaa catatttgtt tacttgctta gtgtcttttc tcccgcacta ccatgtaagc 300 

<210> 229 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 229 

gattttcgaa actcttcagc tacttgccct tttttatctg aaaccatcat accttctgaa 60 

agaaaaaagc atatcttcat tgacataaca gaagtgagat ggcccagtct tgatacagat 120 

ggtaccatga tatatatgga gagtggcatt gtgaagataa catctttaga tggtcatgca 180 

tacctctgcc tgcccagatc tcagcatgaa tttacagtac attttttgtg taaagttagc 240 

cagaagtcag actcatctgc agtgttgtca gaaacaaata ataaagcccc aaaagataaa 300 

<210> 230 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 230 

acttcttgtt tgcctttttt ataaggaaat gttggagagt tacatcattg ctaatgtaga 60 

aatgttaagt ggaaaaatat acagtttggt aaaataaact agattctaca tttatttgtg 120 

ggtttttttc ccctcctttc tttccacagc acttttgata tcaagcaagt ggcttccttt 180 

ttgagatatt aaaaaaaaaa agaaaaggaa aaaagtaaat gannnnnnnn nnnnnaaccc 240 

tttctnattn gnattngttt nagnattgng aagttgngtt aaanagtnct agntagaaat 300 

<210> 231 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 231 

tgattctttt tgttnttttt tttgatattg acaaaagctt anncnttncn attaaaaang 60 

ccactaatta gactttttan ntaaaaaang tagggggttt taaaactact ttcctactac 120 

caaaaaatca naaagtatct agctttctaa atngggaaag caagcaatgt tataaaaacn 180 

ctgaaggaat ctctttcttc gggacctttt gttaaactcg gttnaagctg taaaccttat 240 

ttaaaataaa atttaccaca naacaggaaa tanaacctgg ggaanactcn aaatacncct 300 

<210> 232 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 232 

ggaagccaag gcctggagct gcaggtcccc cggcatctct ctctgtcccg 
atggcctggt gcccccacct gctgcagcag gagccccaag gagtgctagc 
tgctggggtg gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg 
ggatcacctg ggttccaggc catccttgct gagcatcttt gagcctgcct 
gcagaaaagg ccagaccctg ctgagttaga ggctgctggg atccactgtt 

<210> 233 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 233 

gaggaagagg cctgctccac ttgtctggga acctgggcag gaggcacaga ggaagccaag 60 

gcctggagct gcaggtcccc cggcatctct ctctgtcccg gcagcccagg atggcctggt 12 0 

gcccccacct gctgcagcag gagccccaag gagtgctagc tgagggtggt tgctggggtg 180 

gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg tggtgggagg ggatcacctg 24 0 

ggttccaggc catccttgct gagcatcttt gagcctgcct tccggtggga gcagaaaagg 3 00 

<210> 234 
<211> 300 
<212> DNA 

<213> Homo sapiens 

<400> 234 
ggaacataat tagcttactg atttgatggt 
gtttgccttt ctttaactct ggctccttct 
attgagtgag gaggacaggc agaactgtgt 
gctcttggtt ttttgctcac ttttatatgt 
agtagaggat cagatcattg ggacttgttc 

<210> 235 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 235 
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gcagcccagg 6 0 

t gaggg tgg t 120 

tggtgggagg 180 

tccggtggga 24 0 

tccacacagc 3 00 



tctgtgtagt tcctgaaact 
ccttcttctg tttgtgtatc 
ctgccaagga ccggatgtac 
aaggtattag tacaaaccta 
tggtttcaag aaagaattaa 



cttggctctt 60 

tgtttaattc 120 

ttctttcctt 180 

aaggagagaa 240 

caaattgccg 3 00 
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gttggctcaa gggccaccag aagcatttct ttattattat tattttttaa cctggacatg 6 0 

cattaaaggg tctattagct ttctttccgt ctgtctcaac agctgagatg gggccgccaa 120 

ggagtgcctt ccttttgctc cctcctagct gggagtgacg ggtgggagtg tgtgtgccca 180 

Sfgtgggggtg tctcctggct gggaaggagg gaaagggagg gagagttttg cgggggttgg 240 

cagtggagag caggctggag aggagatggc taatagctgt ttaatggaaa cctgctgggc 300 

<210> 236 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 236 

gaatcatcga aggttgagac cgtgtctagt tacatagtta taaataccca tctatgtact 60 

gatgccttct aaatgtctat ctccagtatg gtcttttcct ttaagctcta gatccattga 120 

caccctcacc atctctaaaa ggcatttcaa actgaacaca tctgatacag aacttttcat 180 

ttccttccca actttgccca cgccagcctg ctcctccttc acgctttcca cttagtatat 240 

gatcccacta ttcactcagt ctctgaagct taaaacctag gattcatcct tgactactgt 300 

<210> 237 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 237 

caggacatgg agcagtacct gtccactggc tacctgcaga ttgcagagcg gcgagagccc 60 

ataggcagca tgtcatccat ggaagtgaac gtggacatgc tggagcagat ggacctgatg 120 

gacatatcgg accaggaggc cctggacgtc ttcctgaact ctggaggaga agagaacact 180 

gtgctgtccc ccgccttagg gcctgaatcc agtacctgtc agaatgagat taccctccag 240 

gttccaaatc cctcagaatt aagagccaag ccaccttctt cttcctccac ctgcaccgac 300 

<210> 238 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 238 

cactggctac ctgcagattg cagagcggcg agagcccata ggcagcatgt catccatgga 60 

agtgaacgtg gacatgctgg agcagatgga cctgatggac atatcggacc aggaggccct 120 

ggacgtcttc ctgaactctg gaggagaaga gaacactgtg ctgtcccccg ccttagggcc 180 

tgaatccagt acctgtcaga atgagattac cctccaggtt ccaaatccct cagaattaag 240 

agccaagcca ccttcttctt cctccacctg caccgactcg gccacccggg acatcagtga 300 

<210> 239 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 239 

atttcctcca gtcctgggcc ccatccttga gggccttccc agccagccag caggagaggc 60 

aagaactggg ggaacacagg aacctagggg aggaggggag cgctgggcat cctcaggctg 120 

gcggccaagg cctgcccctg gaggcactag aggagggcat ctgtctgtgg gagcccagag 180 
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gctgcaggga ggaggaggag ggaggtatct ggtgtgagcg ttgcccctgc gacatttggg 240 
accacacagg tgggcttcct tattccctga caaagcctct gtttccagct cttccgccct 300 

<210> 240 
<211> 274 
<212> DNA 

<213> Homo sapiens 



<400> 240 

catgagtgat attttggtct gggtttcctc ttaagatttt agtttgtctg aattaaggaa 60 

aaatgttttt aatatacatt cttattttgt cccacccctc cagaaataag ctggaaatct 120 

taactttttg gggggtcttt tttggtgttt taatgggccc agaactgtgg tttaaatttt 18 0 

tatgtatgta ttttcttttt tgtggagtat aaatttaaaa actggatttg ggacctaaaa 240 

tactcctcag gttgatgtat tcatgaaagt ttta 274 



<210> 241 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 241 

ctgttgcctg ccaagctcag ggcccattta tcatgcatct tcccatcctt gtctccccca 60 
actgtccctt acctgagtca caatttcgcc aaagccaaag ggattgtcct aagccaatgt 120 
tgatttatca ctcttcctgc tcaaaagccc ccaagatcac ctatcaatca cctacttgag 180 
tgcaagcttt gactctgtca cctgacattc aagtccccct ctgcccccat gccagtctta 24 0 

tcccctcccc tacatatgcc ctatgcctca gtttgccttc cctccacttt aaaaagcctc 300 



<210> 242 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 242 

ccgctggcta tgtggacgct ggggcagagc caggccggag tcgaatgatc agccaggaag 60 

agtttgccag gcagctacag ctctctgatc ctcagacggt ggctggtgcc tttggctact 12 0 

tccagcagga taccaagggt ttggtggact tccgagatgt ggcccttgca ctagcagctc 18 0 

tggatggggg caggagcctg gaagagctaa ctcgtctggc ctttgaggta atggggggtg 24 0 

gcggtggtgg ggggtgctta gtggctatgc tcaccccgct ccaggaggcc tattttggta 3 00 

<210> 243 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 243 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 60 
gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 120 
atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 180 
gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 240 
acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 300 
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<210> 244 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 244 

agtaaatttt ttatgcatat tttattgcaa taaaaaatga 
gattttggat gcatctatgc cttgagaaat gaatggtttg 
gaataaataa ttatgttaat tcatataata tgttatatat 
cactgtcttc ctatgggtag ggctataatg tccagttctt 
gtctgaagtt aatccttgtt tgtcgtaatg ttattaattt 

<210> 245 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 245 

tagacataga aaacatacag taagaatatg gtattataat cttacgggac cactgtcaaa 60 

tacgcggtct gtctttgaaa agttgtaatg cggcgcatga ctataaatac ctagctggtt 120 

agcatttaca ttccttgcca gggagtttga aatttatact atagaaataa ctttaggttt 180 

taggtagagt taaagaggta aagcacatgt tgccacaacc caggaaagta tttttaagaa 240 

agattggatt ttcctacctt tagagatcta aaaaaaattt aatataaaaa atcattttgt 300 

<210> 246 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 246 

tggaatattt gctgtgaagg gagaaaggga gagaaaactc ttctgaggat catttgtctt 60 

ggtagtatag taaaaccaac cagctgaacc tttcaggcta caagagaacc cgggtcggta 120 

atgtcttttt aagaataatt tttaattgct tataacaagc atattttgtg gcatttgaac 180 

tatatttact gctccaatat ccgttatttt ccaaaggatt ttgtatcttt ttgaaaatgt 240 

ttacatcatc agatgatcca cagaattcac tttatgtgag atctcccgag agtttccatc 300 

<210> 247 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 247 

gtgttgctca gtgagcagac ccgactccag aaggacatca gtgaatgggc aaataggttt 60 

gaagactgtc agaaagaaga ggagacaaaa caacaacaac ttcaagtgct tcagaatgag 12 0 

attgaagaaa acaagctcaa actagtccaa caagaaatga tgtttcagag actccagaaa 180 

gagagagaaa gtgaagaaag caaattagaa accagtaaag tgacactgaa ggagcaacag 24 0 

caccagctgg aaaaggaatt aacagaccag aaaagcaaac tggaccaagt gctctcaaag 300 

<210> 248 
<211> 300 
<212> DNA 



aaacagtttc aatctaggag 60 

atgtaaatgc atggtagcaa 120 

agttttaaag aaaattctat 18 0 

tcagggatta agagggtagg 24 0 

attcaaccaa gacttaattg 300 
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<213> Homo sapiens 
<400> 248 

gagaggatca cttgagctta ggagttcaaa tccagcctga gccaacataa caagactttg 60 

tctctaaaca aaacagttat tgtttaaaga atctgaaatc ttcatcttta attcaggtag 120 

caatgaatcg agcccaagtt tgtttgatat ccagttccaa gtctggagag aggcatcttt 180 

atcttattaa agtatcgaga gacaaaatat cagacagcaa tgaccaagag tcagcaaatt 240 

gtgatgcaaa agggctatca aagggaggct ttttacagag aactaaggaa gagaaggagg 300 

<210> 249 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 249 

ctagcctggg caatatagta cgaccctgtc tttactaaaa atgcaaaaat taaccacgta 60 
tggtggctca cacctgtagt cctggctact gaggaggctg atgcaggaga atcatttgaa 120 
cccaggaggt caaggctgca gtgagctatg attgcaccac tgcaatccag cctggacaac 180 
acagtgagac cctgcctcac aaaaattata ttctgatttt ctgagtccat gaacacattg 24 0 

tccaaatgga tttttctagc tcctccaagt tacagatagt tccacgcaca cacagaactc 300 

<210> 250 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 250 

aggaaggtgg aggggcagga acaggacgga caggccccgg gctctggcac atcctgggga 60 

acaagggacc acaaggacgg gggcagtctc cagacttccc ctgggcgctt gaccccaggc 120 

cttgcagggg agagagccag ggcctccctc aggtctttgt tcatgctgtt ttccctgccg 180 

tggacaccct ttcccgctct ccgattctct aaatcctgcc ccatctccca gatcttgttc 240 

atgtccaagc ttttccagga agtcttagca gctcccacac cgcagagctc gagatgtctc 300 

<210> 251 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 251 

gaaggcagaa gtgtaaatga acatacagaa gaaggagaaa gcctgctgtg tttggcttgt 60 

tcagcagggt attatgaatt agcacaagta ttgcttgcta tgcatgctaa tgttgaagat 12 0 

cgagggaata aaggagacat aactcccctg atggcagctt ccagtggagg ttacttagat 180 

attgtgaaat tattacttct tcatgatgct gatgtcaact cccagtctgc aacaggaaac 24 0 

actgcgctaa cttatgcatg tgctggagga tttgttgaca ttgttaaagt gctccttaat 300 

<210> 252 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 252 
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gcacttctct ctcactggaa agagaactgt tctcctttct ctttcttctg cctattaagc 6 0 

ctctgctcct aaactcctca tgtgtgtctg tgtcctaaat tttcctggca tggcaggaca 12 0 

aaccccgggt atttaccaca gacaacaaaa ccgcttcact atgatgtatg catgctgcaa 18 0 

aggaagagac agaatcttgc tctatcaccc agctggagtg cagtggcacc attgcagctt 24 0 

actgcagcct caaactcctg gctcaaggga tccttcagct tcagcctcct ggttaactag 300 

<210> 253 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 253 

gtctgatgca ggagaattgc taaaacccag gagggagagg ttacattgag ccgagattgc 60 
gccactgcac tctagcctgg gcgacagagc aagactccgt ctcgaaagaa agaaagagaa 120 
aggaaattcc ccagggaagt acctcggctt atttcataaa caggtactga aggaagcaga 180 
ggcatgtgga ggacttcccc acctcgtgca gctatttggg ccgtggcatc tgaaatttct 24 0 

tatttcagag tcaccccttt gatgaccttg gcagtgaact gcagtcatct gtttaggcct 300 

<210> 254 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 254 

atgttacaga catgaaatat gaacagaatg ctaaaagaac ataaaagaat aagagctcct 60 
taaagattat aaataaatgg tgatgttaaa gtaatagcac cattggacga agctagggaa 120 
tcaacacttg acagaaagat acatattttt tttatacaaa ctacatatat ttgagcaatc 180 
aagtagtaga catagagaat tttcttttta tggaagtact ctaataagta aagggctgat 240 
agaattatat cagcattttc tagctcctgg ggaattatgc attgggcatc catggctgct 300 

<210> 255 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 255 

gctgcctgtg gcatagccac tgctgtacgt ttttggttgt tnttaagaaa ctcgatgaag 60 
aggggtgtca ttctgggctc ggggtggttg ccaatttttc accagaaagg gagccacccc 120 
ttgcaaccac ttctgtctcc gttagccccc cctctgccct cctccaagcc aaagcgtggc 180 
ctggcttttg tcttcccatt tagttttcct cttttaccct tccttttgtg cttaatttat 24 0 
taaaatagtt gctgtataat ttattttcat aaactataaa aaaatactaa atggttaaaa 300 

<210> 256 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 256 

acagtctcgg gtttcatatt ttgctgtttt tgatggacat ggaggaattc gagcctcaaa 60 
atttgctgca cagaatttgc atcaaaactt aatcagaaaa tttcctaaag gagatgtaat 120 
cagtgtagag aaaaccgtga agagatgcct tttggacact ttcaagcata ctgatgaaga 180 
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gttccttaaa caagcttcca gccagaagcc tgcctggaaa gatgggtcca ctgccacgtg 240 
tgttctggct gtagacaaca ttctttatat tgccaacctc ggagatagtc gggcaatctt 300 

<210> 257 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 257 

atagaactag gcactgattt gtttatattt atcctgctcg agacacatga tgtttcatgt 60 

atctgtggct ttttatagtt taaaataatt tctggaaaag tcatagtcat tatctcttta 120 

accgctccct ctcttccatt ctctttgttc tctcttcctc gaactcctgt tagtcatttg 180 

atcctccata tctctgaata tttttgtatt tcttttatta tttatttctt gtctctgcta 240 

cattttacat tgagtaaaag tgggatgtga cagtgggaaa tcattagtga cttagaaatt 300 

<210> 258 
<211> 285 
<212> DNA 

<213> Homo sapiens 
<400> 258 

tactctatta tattgtgcat gctcctgatt tagctgctct tggcatcatt ggtcgcagtg 60 
gaaccttgaa atgcatctgg ctagatttat gctcaaatca ttctcagtta gccttttagt 12 0 

gcctcttcaa aggttttttt ttgtatgttt tctattctta ataaaagctt aggattaatt 180 
agaaagaatc tgatatggtt atgtttcccc ttgtgtacgc tgacctcatt catacgtttt 240 
tcatagtcca gtggtctaaa cgctttcaag agcccagctc cttgg 285 

<210> 259 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 259 

gccttctctg gcctcaccaa ttaggtcaaa tgttccttag aatgtgttgt ggggcatggt 60 
ctctccctgt gaggacctgt ccagctggac ctccgccttc ctgcgactgt attggtgtct 12 0 

ttccctctca agcctatgag ctctgcaagg gcagggaccc tgtatgattt tgcctatcgt 180 
atgtcctcca gcccccagca cagcgcctgg tgtccagtga gagctcagca aatactttgt 240 
gagttaagga caggcggctg ggtagatgga tcgtctgcct agacagggca gttattcgct 300 

<210> 260 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 260 

gaaaagggag ccgcgcagcg cctacgggag tccggcggca gcagccggta ccggcaacca 60 
cgggcagctc tcagggaatc tccgtcgtga ggccagaggc tccagtcccc gcgagtccag 120 
atgcctgtcc agcctccaag caaagacaca gaagagatgg aagcagaggg tgattctgct 180 
gctgagatga atggggagga ggaagagagt gaggaggagc ggagcggcag ccagacagag 240 
tcagaagagg agagctccga gatggatgat gaggactatg agcgacgccg cagcgagtgt 300 
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<210> 261 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 261 

tttgctttca gtggttggct ttcactgaaa gaaagtgtaa aaaaagtcag aatttatagc 60 

tttcactatg tccaagacta ggactgggtt ataaagattt tcttttgtga aggaaaataa 120 

aagaaaattt gccactactg catttacttt actattgtaa acttaagatt cattccttag 180 

tctttggaat tttgatgtct caaaaccaga tgagtggaag tgctgaattt gcaaaataaa 240 

gctaagaatg cttaactctg cactttaagt tctactctga ccaaattgaa gatgagcaga 300 

<210> 262 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 262 

ttttttaaga gataaggtct tgctatgtta tctaggctgg cctaaacttc tgggctgaag 60 

tgatcctcct gtgtagctgg gactacaagc atgtgccacc aatgcctggc ttctcacact 12 0 

gttttgtaac atagatatgt gaagatgtgt attatagaat tgtttgtaat actgtagtgt 180 

tgtaggcaat gtgactgtct atagggaagt ggacaggtta tttgtggtaa atactcatgg 240 

aaaacggtca agcagttaaa agcaatcaat tatggtcacc cagcaatgca gataaatctt 300 

<210> 263 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 263 

agaacaggga gaagagagga agagggagct gcaggtgcca gaagagaaca gggcggactc 60 

tcaggacgaa aagagtcaaa cctttttggg aaaatcagag gaagtaactg gaaagcaaga 120 

agatcatggt ataaaggaga aaggggtccc agtcagcggg caggaggcga aagagccaga 180 

gagttgggat gggggcaggc tgggggcagt gggaagagcg aggagcaggg aagaggagaa 240 

tgagcatcat gggccttcaa tgcccgctct gatagcccct gaggactctc ctcactgtga 300 

<210> 264 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 264 

ttaaaggtag ttttagaagg aagtacaaat tggctttcat cttgcaaaca atcgtttttt 60 

acttcattat cttaatttgc tttgtcactc ataaaaagga aaccatacct gagttgtaga 120 

caatgaggaa acacttgagg cttctgctgt gtgttctttt gttattgttg ttattgttgt 180 

tactcagtaa cttgaatatt gtttaatgtg ttgtaagacg tagagtttat ctcaagctgt 240 

taaaaatggt aatgtacaaa tgtgaataga cacttatcta tataatatgg gtaagttttg 300 



<210> 265 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 265 

caggaaagtc ttcctagagg taatttttaa gctgattgtt ttagaattag tagaagcttg 60 

ccagatggaa aagtccaggc aaagtgtaac atgaatggga aaggccacag tctagaaatg 120 

gcagagtgtg ttcctagttt gtttgtttgt ttgtttgtac ctgccttgtt ccaggaagga 180 

tttaatgtgg tttatattcc agtcctttaa tgctggaagg gctgagatga gactgaaaga 240 

tgggcaggaa gtatatcatc acaagctttg tgtttgatgt taatgtgtat gatttttata 300 

<210> 266 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 266 

tgtgccacca cacccagctc attattatta ttattattat tattattttg agacgaagtt 60 
tcactcttat cccccaggct ggagtgcaat ggtgcgatac tggctcactg caacctctgc 120 
ctcctgggtt caagcggttc tcctgccttg gcaggcacct gtagtgtcag ctactcgaag 180 
gctgaggtgg gagaatcgct tgaacctggg gggcggagat tgcaatggtg tggtctcggc 240 
tcactgcact cgagcctggc gacagagcaa gactctgtct caaaaaaaaa aaaaaaaaan 300 

<210> 267 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 267 

atataactct ggaggtcagg acataggaga tattgattca ggacttgcca gagtatggtc 60 
ttggggtgtg ccctgatatt acaaacaggg atcttagtgg ctaggtgatg aggccatggc 120 
aaatgtagat ggaccaagat caatttgcct ttctagatga ggttttctag gtgaaatgtt 180 
tttgaaacta ttttgtagcc tagtataatt tataaaagta gagagaaact ataaatataa 240 
atttggaagg ggttagctaa aaggagaaaa cagcagaatc ttcatatata tagaaatgga 300 

<210> 268 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 268 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgtgt 60 
gtgttgcatt acttttcatt gggattgaac tgagaaataa caaacaagct ttaagtggga 120 
aattaaaaaa aagaagtaac ctatgtagat ccaaacttaa aatgtgagaa attattgaaa 180 
tttcattttc tacaaacttg aaattagcct gctaattgta aagttgtttt aataatgctg 240 
acaaatgtca gttacgtttg caaaggagtg tatggttcta ggtatttgcc tactgttacc 300 

<210> 269 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 269 
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cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 6 0 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 24 0 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 

<210> 270 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 270 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 60 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 18 0 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 24 0 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 3 00 

<210> 271 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 271 

ccacatttaa gtgagatatg ggaaggagga gcagattgtt tttgaaggga ggaagagcag 60 

ttacttaggg tcaaattaag ttgtaaaatc ccccccggga ttttgtatgt aagtcaaagt 120 

gaattgtatt tggaagaaga actggggagc ccacctctgg tatttttttt atgtccctca 180 

tatggacaaa taaacctctg gtattaaatg aattttcttt tgggggattc tatatattcg 24 0 

ggatttcaac caccaaccta tctggttttt cccgctgaaa tgttgggtga tggaatcagg 300 

<210> 272 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 272 

gaacgcttcc attttatacc tgtgtctagt tagtttctgc ctatctatcc aagaagcttt 60 

tatcaagggt ccaccatgtg ccagccactg aagtagatat aaatacaagg atgtgtaagg 120 

tatggatgat ggtatacgaa ctgtcatctt actggatttg tccgctctgt taaagatacg 180 

gttccgaaaa ctttttaaag ccctagagag ggctttaagg caatgtagca tcatatatag 240 

aggcatcaac ctgttcatat ctttctattt aacagaactg tgcacctggg cacaagggtg 300 

<210> 273 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 273 

gaatggcgtg aacccgggag gcagatggtc ttaaagtggg gagacccggg ttacaggcct 60 

gactgcatca ctaactcgct gtgtgtccct gggcaagtca gtgcagtgca gtagcctctc 120 

cgtctccgac tgaggagcaa agccctcggc tcaagatcct cacctacttc acagggattt 180 
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gaaatagtgc agtcaacagg aaaagaaaag cgctatagaa atgctcgacg ctatcacttg 240 
gggcccacgt ggaagtatca acgtataaat tggcccaggc agacagaagg atgcagggga 300 

<210> 274 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 274 

ggaaccaggg gctgcagaac cagcccctcc ccaatgagga ccccctctgg acgcccctcc 60 

ccatggagaa caccaggagc cacagacccc agaccacagg agcacacagg ggagggcacg 120 

gggcggccgg ggcagggtgt ctgctgcctc gtttatggga tttgctccgc gtctagcaca 180 

ctgctgcctg cagtgctcct gtcccctgca gtggctactc tgggcctacg ggcctaatcc 240 

tggttggcat gaaaatgtcc tgaggctact gtgacaaatt tccacaagct gagtggctta 300 

<210> 275 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 275 

ctttgggaag cagaggtggc aggatcattc cagcccagga gttcaagacc agcctgggca 60 

acacagtgag tgagaccctg tctctattta agaaaaaata attaagaaat tttattaaaa 12 0 

aagaagaatc aggaaaccaa gtccaaccca actaaacctc aaatgaacca gcccctaaca 180 

cagatgaggg gatttgggac tgataagctc tgtgctgtgt ccatggcccg tcatttatca 240 

aggctgcagc tttgtaaatg tggctatttt tatgttgtgt atagtttcta teat t tat tt 300 

<210> 276 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 276 

tttgtatttt tagtagagac agggtttctt catgttggtc aggctggtct caaactccta 60 

acctegtgat ccgcctgcct cgacctccca aagtgctggg attacaggca tgagccacca 120 

tgcccagcca aagatcattt ttttatatag acttcagccc tttgtaaata ttgtaactgg 180 

ggagtataga gtagaaaaaa agtatagtta aaacatttgt tctacaaatt aacctttaaa 24 0 

aatataatta ctgctaaaaa tagagtgctg ttacacttaa ggaaaattag tgccattttg 300 

<210> 277 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 277 

ctcacacagc atgtgtcaga tccatggggt aggagtegge cagagacttg gtaacagaca 60 

gattgctgga tcccacccct agactctctg attcagttag tttggggtaa ggegcaagae 120 

tgaatttttc acaagtttcc cagtggtgct gatacttctg gtccaggaac ttagtgggag 180 

agaacgacta atctagacca tttcacttca cattctgagc ttcttgtaca ctgtcacact 24 0 

gcatcctttt aacaatgeat tccctatcct attgeaatae tgacatctca tcaatatttt 300 
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<210> 278 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 278 

ctgacaactt gattgggttc tccttcaggt ttgaagcgcc ctcgagaagt gtctaaagga 60 

gacagttgat agccaaacaa cagttttgga ttcactgact gattatgaaa gaagcagtag 120 

actggtatca agaatcagtc agcaaggagg ccctcaccag acgccagtgc catgttcttg 180 

gacttctcag cctccatatt catgaactaa gtttttggaa tccttaggct tccacgtgtg 240 

gaaagcctga gctaacctac tggaggatga gccatcacct ggagcagatt caggccatcc 3 00 

<210> 279 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 279 

ggtaaaccta tttatataat agaaggatga ttataaacat ttaataaatt atatcaaata 60 
gatattatat attaaatggg cagataatag aaatctgtcc aagcaaaact ctggataatt 120 
tttatgttgc cttatttttt gttttctgtg aactccaaga aaaatgagat accagtttgg 180 
aacagatgta atattgctga tttaacagtt tagggatact ccccaagttc aataattttg 24 0 

ccaagataca aatttaaatg gaacctttta tgaagcttca tagtgtgtga agaacttacc 300 

<210> 280 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 280 

ataactgctt gcgaagatgt agtttctctt tggaaagcta cacacacgag attatacaca 60 
tcaggcactg gaactatctg taatactgga acctctgcga agtgccaggt ataaagtttt 120 
tcccactgcc aagcatccag agctttggga aatttggaaa tcagagagat cagggcattg 180 
ttttgttcct ctgatgatga aagtgaaaag caagtactac tgaagtctgg aaatataaaa 240 
gctgtgcttg gcctgacaaa gaggaggcta gttagtagca gtgggaccct ttctgatcaa 300 

<210> 281 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 281 

caccatcgaa tatttttatt tattttgaga gacagactct gtcacccagg ctagtcttaa 60 

actgttggtg aatcttaagt gattctccca cctcagcctc ccaaagtgct gggattacag 120 

gcatgagcca ctacccttgg ctgtgatcaa gtatttagtc tgttgttaaa tgtttactaa 180 

atagtctgaa gtagagaaaa tagcacccaa tctaaaataa ggtgaggtct agtcacttat 240 

ttaaatctac attttaagct atagtttact attagtttaa actttaagac aggtaatgtt 300 

<210> 282 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 282 

gcaaccttcg cctcctgggt tcaagtgatt ctcctccctc agcatcccaa gtagctggga 60 

ctacaggcac gtgccaccac acccagctaa tttttgcatt tttagtagag gcagggtttc 120 

atcatgttgg ccaggctggt ctcaaactcc tgatctcaag taatctgccc actttggcct 180 

cccaaagtgc tggcattaca ggaatggagc caccgcgccc agcctgattt ctttttttag 240 

gtcttgtcag gaaagatatt gattcttttg attcgtgaac atggtttttg gtcgtcttta 300 

<210> 283 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 283 

cccaggtagc tgagactacc cacaccttgg tcccagctac ttgggaggct gaggtgggaa 60 

aatcactttg cccaggaatt caaggccgca gtgagctatg attgcaccac tgcactccag 12 0 

gcaacagagt gagaccctgt cttaaaaaaa gaagggagaa agtgtcagat ggtgatgagg 18 0 

tctggggggg aaatagagaa tggggatcag gagtgtggat ggtggtattc cctcaccaag 24 0 

aggtgacatg tgagcaggga gctgggaggt gagggtgtga cccgtgtgga aatcagggaa 300 



<210> 284 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 284 

ggtgtcctcc ccagtgcgcc gcgatttttg tgtccaagcc ccagagtccc tctgagacca 60 

acccccagcc agcacagact tcctgccttc ccagctcgga agcgccctcg agaagtgtct 120 

aaaggagaca gttgatagcc aaacaacagt tttggattca ctgactgatt atgaaagaag 180 

cagtagactg gtatcaagaa tcagtcaggt ttttggaatc cttaggcttc cacgtgtgga 240 

aagcctgagc taacctactg gaggatgagc catcacctgg agcagattca ggccatccta 300 

<210> 285 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 285 

aattccgttg ctgtcgggcc gccatgtcat tctggagaga gacagagtaa aacaaagaag 60 

gtgatgggta aagcgcagtc gcctgctata tattgtctat ttttggtttt tcacttacct 120 

tttatattta tgtcttttat gtacaacagg attataagta gcttgagtcc agtgaatata 180 

ccatttcatt ttgctatcct tcactgcact tagcttagag gaaataatca cagcttatta 24 0 

ttgattaatt aattaattaa tagatgaatg gtgaacacat gactatcatc ccaagaaatg 300 



<210> 286 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 286 
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agccaatgag gcttttgcct gccagcagtg gacccaagcc attcagcttt acagcaaggc 60 

tgtgcagagg gcccctcaca atgccatgct ttatggaaac cgagcagcag cctacatgaa 120 

gcgcaagtgg gatggtgacc actatgatgc cctgagggac tgcctcaagg ccatctccct 180 

aaacccatgc cacctgaagg cacactttcg cctggcccgc tgcctctttg agctcaagta 24 0 

tgtggctgaa gccctggagt gcctggacga cttcaaaggg aaatttccgg agcaggccca 300 

<210> 287 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 287 

gggtgacaga gtgaaactcg tatctccaaa caaacaaaca aaaagtcctt aaacatatgt 60 

gaacaaaaat tttgtgatgg aaggattcta gttaatgagt attgcatcaa gatttacatc 120 

tttcttacta aggaaaagag ttaataaaaa ttgttcttta ttttacaggc agttactgag 180 

gctcttccca gatctcagta aacagccact cagccttgaa aatggagtgt tgttgtttct 240 

aaacatatat ttatgtcatt tattaagtac agttcactta aataacataa gtagattttc 300 

<210> 288 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 288 

accactaaca gcatctactt gactactgat actttgatca tggagttagg gcatgccact 60 

tgatagaaat ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata 120 

gattgcaaca tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag 180 

cctttgaatc ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa 24 0 

tacaagcaga ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt 300 

<210> 289 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 289 

tgtccttatc tgaaattcag cgatcttcat gaataagcat ttctctgatt gtggnatatg 60 
cctttaattt tatttctaga gtgacaaatt tttggttttg acagtttttt tctagcttta 120 
tagtttcttc ttggggagag aatatgtcaa cctcactcca tcatgctgaa gtaaatcttc 180 
atctcttaat tttatctctc aaaaatatcc taaggattcc ctctggagcc tgataagtaa 240 
ttgcagtatc tggtttctat ggttggatga ttcaggattc caggaataat agttactttt 300 

<210> 290 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 290 

ggaaccatga gaaccgaagc tagaattgct attgaattac tttattttct cttcccttat 60 
tgggtagaga tacatcatta ctggcctcag gggtttaccc aaagaaaggg tatttttgag 120 
caaataatgt gatttcctgg ctattttgtt gggggcttaa gatttttttt tttcaaatgc 180 
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atttttagtc actaaaaatt aactgtcgta ccatctagaa ctatactgtc cagtaccata 240 
gcctctagcc gtatgtagct atttgtatta agattaatgg aaattttaaa tccagttcct 300 

<210> 291 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 291 

tatgatttta tttttggcct aatataggaa tgtttaaaaa aggcttttct atgaaaatta 60 

gaaatttata cttgaaatta aaagtctaca agggggagga ccttaaagct • aagctaccag 120 

taagacaatg aataattcag aagagaacac tattctttta ctgactgagt gcccaagatg 18 0 

ccaatttcca tgaagtcttg atttatatat atgtacacat gttatgcaca tacatgtttg 240 

ttttctaaca gttattcttt aagcttttga gataatttta gacttacaga agagttggaa 3 00 

<210> 292 
<211> 278 
<212> DNA 
<213> Homo sapiens 

<400> 292 

cccagaccta tggagtcaga cagtaggttt gaggcccagc aatctatggt ttaacaagcc 60 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg tttggtatgg 120 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 180 

ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttn nnnnnnnnnn 240 

nntgntttnn cccccaggnt gnanngcagg gncccaat 278 

<210> 293 
<211> 297 
<212> DNA 
<213> Homo sapiens 

<400> 293 

ggaaggcagt gggaggagag gaccaagtct caaactccag aagccccacc tccctgagct 60 

cagctcctct gccaagcccc ctcagcgcga agtcctcgtc cagagaaggc aacggcgaga 120 

aacaaatcca acatcctggg ctgctttttc cttcccccac tttttaaaag tttggtgtcc 18 0 

aagtcacttg acaaacccag accctaacaa tgatattttg tgtagaattc tgggatcaaa 24 0 

atataatttc aaaaataata tattttctga catcccccaa aaaaaaaaaa aaaaaaa 297 

<210> 294 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 294 

ggaacagttt gagcaaaggc tctcaagtaa tagggtgtct gacttgttca tttttgaaag 60 

tagaactaat aggatttctt attggaacgt agggtgtaag agaaaagagg agtcaaaaag 120 

agccacaaga tttttggtct cagcaattag aaggatagaa ttgacattta ctgagatttt 180 

tgtttttgtt tttgagacgg agtttcgcta ttgttgccca agctggcgtg caatggcgtg 240 

atctcggctc agtgcaacct ccacctccca gattcaagcg attctcctgc ctcagcctcc 300 
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<210> 295 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 295 

gtaatattga tgtgattgtt gtcgcttgag aaaaaaaggc aacagctgat tctttcaaca 60 
actgtcacag aatggctggg ctgagaacgc tgcccagggc cctgcagctg gcgggagnnn 120 
nnnnnnnnnn nnnnngtgcn tgctgcaaca tntggttana tngtatcctt ccctanagnt 180 
gctacnnctt nnatcccctt gtnaatatgt tgagntnnct tngcnttcnn gntnntccng 240 
ntnnttgaca cntatgnaan ttntntngtc tngctctgct ngatnncttn nangctgcc 299 

<210> 296 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 296 

gcagaacctt ttcccctcta ctcttgtcta aaagttctgt gtggcacaca 
ctactcaatc tgacttagta aaaccatgct gaaaaatttt ggtctaaaaa 
ccagcaccca tgaaataaaa gattcatctg taattgggat tcaaagggat 
tggtcatact cataaatagc actaaagtgt tataacattt tcatttacct 
ccttcatttt aacttaataa aaatcttgga ttgatattct tttttttttt 

<210> 297 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 297 

gctaggatta caggtgtgag ccaccatgcc cagccactta tctttaaagg attaagttta 60 

tgtttcctac tatgggaaac catcccaccc caaacttgat gaccgcatta tgtgctttta 120 

tagaacatgg cacttctcca ggatagcatt tattctgttt tgtaagtgtg aatgtaatta 180 

ccctacacac agcatacaca taatcttcat attctttgcc ttgtcttgtg aaggcaaggg 240 

ccatgtctat cttattcgtc attagattcc cacatccaac atagtcctgg ggacagcacc 300 

<210> 298 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 298 

ccaaatctgc ctagagattg agttcacagt gtatgttctg ggggcgctgg 
ggtccagtct ccagcctgca ggcgtgcaca ctggggtgga cgatgggtgg 
gtacacattt gggtggcccc ggcccctata ccccagtgtt ctctttgatc 
acagagggag ccttgtgtac acgcctccaa agtggagctg ggaggtagaa 
ctggtggttc tactgaccca actgggggca aaggtttgaa gacacagcct 

<210> 299 
<211> 300 
<212> DNA 
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gagatgcgac 60 

ggacccatac 120 

taaattcctt 180 

atttttagtt 240 

ttttgggacg 300 



tgcagtcagc 60 

ccccgcaggt 120 

cagtcccgaa 180 

ggggaggaca 240 

cccccgccag 300 
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<213> Homo sapiens 
<400> 299 

ctccattgtg aagatccagg catttttccg agccaggaaa gcccaagatg actacaggat 60 

attagtgcat gcaccccacc ctcctctcag tgtggtacgc agatttgccc atctcttgaa 120 

tcaaagccag caagacttct ctgctgctgt gatctgcaca ccctccaacc tgggcaggga 180 

ctggggggat gcagtgtgtg ttagtgccca tgtggcattg tggcactgtt gccccccatg 240 

gcggcatggg caagatgacc ttccattagc ttcaagtctt gttctcttgt ctgtggtctg 300 

<210> 300 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 300 

agcaattcca ctcctagctc cacccacagg aattgaaagc aaagacgcaa acagatgcct 60 

gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc gtcacagcgg 120 

catcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc ttcgccacag 180 

cggcagcatc tgtcgtcaca gcggcagcat ccttcgccaa agcggcagca tccttcgtca 240 

tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc actgaggcgt 3 00 

<210> 301 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 301 

tcacagatat gaaagttcag tcagaggggc tgggccgaca tctgtgcttt tccctgcagg 60 

atttttagga tcagtgagac ggtgtgtatt tggaagcatt tcaaatgtgt taccatcgtg 120 

ttacttccgt gggcacctgg tgttattggt tggactagtc aggattctcc agagcagcag 180 

aagcaatggg atgtgtgtgc atgtgtttgt gcagagacag aaagagagat tttaaggaac 240 

tggcttatgc agttgtgggg gctagcaagt ctgaaatttg cagggcgggc cagcaagctg 300 

<210> 302 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 302 

tcaccaggaa tacagtgaca ttaaaagtgt gatatggttt agctgtgccc ccacccacat 60 

ttcaacttga actgtatcta tctcccagaa ttcccacatg ttgtgggagg gacccagggg 12 0 

gaggtaactg aatcatgggg gctggtcttt cccgtgctat tctcgtgatg gtgaagtctc 180 

acgagatctg atgggtttat caggggtttc cacttttgtt tcttcatttt ctcttgccac 24 0 

cagcatgtaa gaagtgcctt tggtctccta ccatgattct gaggcctccc tagccatggg 300 

<210> 303 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 303 
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gccctctcca ttttctgagg aggtgatatt tgggcagatt acaaactgag gaagcatact 60 

ggatagacat caggatgaag agaataggca gttgaaaagt cccagaaagg ggagtgtgct 120 

tagagtgttt gaggaacagc aaggaagcaa gcccttgttg aaacagattg agcaaggtag 180 

aaagtggtaa aagatgaagt taaagaggta gctgagagcc agatcatgta aagccttggt 240 

aaggactgac ttttatttta agagggttag gaagacattg gtaggttttg actctggctt 3 00 

<210> 304 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 304 

aacaggaata tggaaagaaa ctcagagccg agttagtgga aaagtggaaa gcagagagag 60 

aggctcggct ggcaagagga gaaaaggaag aggaggagga agaggaggaa gagatcaaca 120 

tctatgcagt caccgaggag gagtcggacg aggaaggcag ccaggagaaa ggaggggacg 180 

acagccagca gaagttcatt gctcacgtcc ctgttccctc gcagcaagag attgaggagg 240 

cactggtgcg aaggaagaaa atggaactcc tccagaagta tgcaagcgag accctgcagg 300 

<210> 305 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 305 

aatagtagaa agggtcccca ttcctgctca gcaccgcacc tctctacccc cccacagaca 60 
cacatgcaga cacacacatg cagacaacac gcagacacac acatgcaggc actcacatgc 120 
aggcccatgc acacacacgt gcacacacat gcagagacat gcagacacgc aggcacacat 180 
gcacacatgc aaagacacgc atgcaggcac acgcagacgc acacagagac acacatgcag 240 
atacacatgc acacacacat acacacactg gcccctgttt ttctgtggtg tcactgggtg 300 

<210> 306 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 306 

cagcaaagac tttatttttg tacagaagat ggtgaagtcc aagacggtgg ctcagtgcgt 60 

ggagtactac tacacgtgga aaaagatcat gcggctgggg cggaaacacc ggacacgcct 120 

ggcagaaatc atcgacgatt gtgtgacaag tgaagaagaa gaagagttag aggaggagga 180 

ggaggaggac ccggaagaag ataggaaatc cacaaaagaa gaagggagtg aggtgccgaa 24 0 

gtccccggag ccaccacccg tccccgtcct ggctcccacg gaggggccgc ccctgcaggc 300 

<210> 307 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 307 

gctgcttctg gctggggggt ccttggcctt catcctgctg agggtgagga ggaggaggaa 60 
gagccctgga ggagcaggag gaggagccag tggcgacggg ggattctacg atccgaaagc 120 
tcaggtgttg ggaaatgggg accccgtctt ctggacacca gtagtccctg gtcccatgga 180 
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accagatggc aaggatgagg aggaggagga ggaggannnn rmnnnnnnna ntggccttnt 24 0 

gtggcctcca ccagcagctn tnnannatga catggagtcc caactgnacg nctccctcat 300 

<210> 308 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 308 

agttaagagt gtgaacccta gatttgccat ctgaaagtca tgtgtccttc agtgatgcat 60 

ttaacctctc tgtgcctcaa atttctccct ctggggtatg ttaggagtat acaaattaac 120 

acatgtaaag tgcttagaat agattggtac tgttaaatat gagctaacgt cacatttgat 180 

atttttttaa aaagaaaaaa tcattatgga gtctcagtcc tagagattct gattcattaa 24 0 

ttctgcttct cggcaaggag cgatttgctg gtgtagacat tccgggtccg tgtaaagggt 300 

<210> 309 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 309 

ccaacaccca gttctcactc tgtcatccag gctggtgtgc agtggtgcaa tgtgggctta 60 
ctgcagcctt gacctccagg acaagtgatc tcccacctca gcctccggaa tagctgggac 120 
tacagctcaa caacgcccct ctgaaagtag gactcttgga aatgaacctt gttgggagta 180 
aagctgaacc ttcacctctc ctttccagga ttctactcca ttcatacggc ctcacactga 24 0 

attaatggtt ctagcagcca catcactttg ttacccaatt gatctagtag taaagtcttc 300 

<210> 310 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 310 

aggaaacacc cccttataaa accatcatat caggctgggt gatctgacag agctagacac 60 
tgtcaaacaa acaaacaaac aaacaaaaaa accccatcac atctcatgag acttatttac 120 
tatcatgaga gcagctcagg aaacacccac tcccgtgatt cagttacatc ccactgggtc 180 
tgtcccacaa attgtgggag ctacaattca agatgaggtt tgggtgggga cacagccaaa 240 
ccctatcacc atgtaaaata atatctaatt tgtagagatt aaagaacaag ataacttaaa 300 

<210> 311 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 311 

ttntgcagat ctccagcaca agcctctgct agttgatctc acggtagaag aaggtcaaag 60 

attaaaggtt atgtttggtt cacacactgg tttccatgta attgatgttg attcaggaaa 120 

ctcttatgat atctacatac catctcatat tcagggcaat atcactcctc atgctattgt 180 

catcttgcct aaaacagatg gaatggaaat gcntgtttgc tatgaggatg anggggtgna 240 

tgtaaacacc tatggccgga taacnaagga tgtggtgctc caatggggag aaatgcccac 300 
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<210> 312 
<211> 275 
<212> DNA 

<213> Homo sapiens 
<400> 312 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 60 
ccacccccca cgcccgaaca gcaggggcag aggcagnnnn nnnnnntaag ngtgtnnaan 120 
tntnnatttn ttcctntttt ttttnnnttn aaatatnntg nnnttttttn ntantantta 180 
ttatnntntn nttattannn tntttttcnt ntnttacttt gttnttgatt ttanncnttt 24 0 

natntttttt ttgttcttct nttntattnn atctt 275 

<210> 313 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 313 

tcctgtcttc ttgcccaaat gttgcatttt ccaagaccac tctggcctgc catgccaccc 60 
attctgtgcc tataaaaacc ctgagacccc agcgggcaca cacacaagcg gctggacgtc 120 
aagaggaaca cactggcaga agaacacatc gaaagacgct ggcaggccat tgatggtgga 180 
acgattcgga cgccaaggga aattcggcca aggacagtag gagatcccgg ctgctgagca 240 
gccagactcc agaggaagac taccttccca tgctatcccc cttctggctc cccagccatc 300 

<210> 314 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 314 

ataagggtgg ggccttaatt cagtagaatt ggtggcctcc taagcagagg aagagagatt 60 
tttctttctc tctctgccat gtgaagacag tgaggagtcg gccgtctgca agccaagaag 120 
agcccttatc aggaacagac ttggctagca ccttcatcgt ggacctccag cctccagaat 180 
tgcaagaaaa tacatttccg tcgttgaaac cacccagtct gtggtatttt gttatggcag 240 
cccaggcaga ctaatacgtg aagcctgctc taaatagata aaataagaaa ttactacaga 300 

<210> 315 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 315 

gtctcagtgt ggcctgtggg gctggtgggc ggcctgcgat tcgagggccc tcaggtacag 60 

gacggccggg tagtgggctt ccacacagca tgggagccca gcaggccctt ccctgtggat 120 

atggctggat ttgccgtggc cctgcccttg ctgttagata agcccaatgc ccaatttgat 180 

tccaccgctc cccggggcca cctggagagc agtcttctga gccaccttgt ggatcccaag 240 

gacctggagc cacgggctgc caactgcact cgggtactgg tgtggcatac tcggacagag 300 

<210> 316 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 316 



gaaatgcctc tatgtaggtg aagtgttctc 
attttcccca caaaagaaac acttaacaga 
aggggaatca tgattataag tccttcagcc 
aatttaaaat aattttgaac gaatttattt 
cattaaatca agacaaatca tgtgcttgag 



tctgcatgca acaggaaaaa ttaatataat 
ggcaagtgca atttataaat ttatatctaa 
cttggactct aaattgaggg gattaaaaag 
tcccctcagt ttttgagggc attaaaaagg 
aaaaataaaa ttaatgaaaa cacagcactt 



60 
120 
180 
240 
300 



<210> 317 
<211> 295 
<212> DNA 
<213> Homo sapiens 

<400> 317 

acactgtccc actccatcac ccaggctgga gtccagtggt gtgatcatag ctcgctgcat 60 

cctccagttc ctgggttcaa gccatccctc ctgcctcagc ctccccagta gctggaacta 120 

caggtgtgtg ccatcacacc tggctttaca tttttctgtg gggacttact atgttgccca 180 

ggccggcctc aaactcctga gctcaagtga tcctctgcct cagcctccag agtatctggg 240 

attacatatg tcggctaccg tgtctggccg ttcacatctt tggccactat ttgct 2 95 

<210> 318 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<400> 318 

cctgaatata aagaggagga ggaagaccaa gacatacagg gagaaatcag tcatcctgat 60 

ggaaaggtgg aaaaggttta taagaatggg tgccgtgtta tactgtttcc caatggaact 120 

cgaaaggaag tgagtgcaga tgggaagacc atcactgtca ctttctttaa tggtgacgtg 180 

aagcaggtca tgccgaccca agaannnnnn nnnnnnnnnn nnntngccnn aacnnttcac 240 

caaatncccc gggggggctt g 261 

<210> 319 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 319 

gggacctctg cccaagaaag cctgggtatt gaccaaggtt tcccccccac tgagacagcc 60 

tgagatatgg cctcatggga agggaaagac ctgactgtcc cccagcccga cacctgtaaa 120 

gggtcggtgc tgaggaggaa tagtgaagga gggaggcctc tttgcagttg agataagagg 180 

aaggcttctg tctcctgctt gtccctggta atggaatgtc tcggtgtaaa gctgaccatt 240 

cccattcgtt ctattctgag ataggagaaa accgccctgt ggctggaggt gagatatgct .300 

<210> 320 
<211> 289 
<212> DNA 
<213> Homo sapiens 

<400> 320 
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caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca gtgcaggaca 60 

caagctcatg cccccgtctt gccagtgaca cttgaagcct cccgacttcc acagagtgct 120 

tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt ttttnnnnnn 180 

nnnnnntngt tntgtnnccc aggctgnann gcaggggcct gatntnggnt aantgnaacc 240 

ttngcctccn aggttaaagc nattttttng cctaancctc naaagtacc 289 

<210> 321 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 321 

gaaagaccga gatagagaga gagacagaga cagagagcga gaccgtgatc gggacagaga 60 
aagagaacgc accagagaga gagagaggga gcgtgatcac agtcctacac caagtgtttt 120 
caacagcgat gaagaacgat acagatacag ggaatatgca gaaagaggtt atgagcgtca 180 
cagagcaagt cgagaaaaag aagaacgaca tagagaaaga cgacacaggg agaaagagga 24 0 

aaccagacat aagtcttctc gaagtaatag tagacgtcgc catgaaagtg aagaaggaga 300 

<210> 322 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 322 

cgccctttaa ctgcagttct gctctatttt cttttctctc tctggagctg agagtcagag 6 0 

ggcccttctc ctcctccttt cagcccccaa cactaagctg atggattgat aaatacctca 120 

gcccctcgcc ttcctcaacc cacctggcaa gtcttcttag gatctgatcc cagttttctg 180 

gaagcaatcc taccccagcc caagcttccc aagagtcgag ccttaatcct tctcacttct 240 

cagtgtcaga gcagaaatga atcctggggt tgactgtgtc cattcgggtt attagcagct 300 

<210> 323 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 323 

agattatgag catgtagaag atgaaacttt tcctcctttc ccacctccag cctctccaga 60 

gagacaagat ggtgaaggaa ctgagcctga tgaagagtca ggaaatggag cacctgttcc 120 

tgtacctcca aagagaacag ttaaaagaaa tatacccaag ctggatgctc agagattaat 180 

ttcagagaga ggacttccag ccttaaggca tgtatttgat aaggcaaaat tcaaaggtaa 240 

aggtcatgag gctgaagact tgaagatgct aatcagacac atggagcact gggcacatag 300 

<210> 324 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 324 

gtctgagaag tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc 60 

tggcttgcag ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg 120 

ataagaactc cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac 18 0 
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atcaagaaga gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga 
gctccagggg aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca 



240 
300 



<210> 325 
<211> 283 
<212> DNA 
<213> Homo sapiens 

<400> 325 

gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca agaaacacag 60 

aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga gcatcctcag 120 

aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta atgagcctct 180 

gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcgtcc acaaggccag 240 

ttcactctct gatcctgtgg aaaggatgcn ngtgtgtagc tgc 283 

<210> 326 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 326 

atgacatcct cattatccac actgcaaagc caaccatccc tatgatgggt tcattgtgga 60 

tcatgactta gtgggtcaag agtttggaag tggctcagct gggcggttct tctgctccat 12 0 

gtggctgcca gatggtaccc tgctggtggg cagtctggtc tagagggtcc atgatggctt 18 0 

tactcacatg cctggcatct tgacagggac agctggaagg caaggttcag ctgggactgt 240 

ccacagagct cctccctgtg gcctttccag catggtggtc tcagggtagc tggacttcct 300 

<210> 327 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 327 

ggtagactgg ctagggatcc tggacccagg gttccacgta gcaacacctg ctgagttctc 60 

tgggttttct tcctgcctca tgtagcccag acttggagct gaagaagctg gaaacatgga 120 

aacaccaaca gctacagacc aaaaaaagtc ccaacaaagg cctgtcagtc tgccagcctg 180 

ttctgtggat ttccaactca agattgcagc atcaactcac acctgaagtt ctggcttccc 240 

tacaaacttt gaacttgcca gtccccacaa tggcataagc caattcctta aaatgaatgt 300 

<210> 328 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 328 

gtcacaggca ggtttaatgg ccagtttaaa acttatgcta tctgcggggc cattcgtagg 60 

atggtgagtg tttccctggg ctttgctcat cacttcggga catcgtggac tttaccgtgc 12 0 

gcattggagt gtgtgatggt gcctgagtag atctgctggc agagtagttt gagccagctg 180 

gactgggctg gccgcctgcc gcttcttgag ggtggaagag gggtgctctg agaagacact 24 0 

caggcagcag actctgcctc tcacttaagg tgcccccccg accccgctcc accatagtca 300 
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<210> 329 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 329 
ttttggtcgt ctttaatttg tctcatcagt 
ggtattttta aaatttcaat ttctaattgt 
attggcattg tattttgcaa ctttcctaaa 
tagattctta aggattttct gtgtaaatag 
tttccttttc aaattttgtg ccttttattt 

<210> 330 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 330 
tcaaggatcg gggtgctggc ctattcagtt 
ttgaactact tcttgctgtg tcttcacaag 
cagaccccaa actcagctca tacacacacg 
gaagaagctg ctggacatca gaaactgtga 
aagactgcct gcacattcta tccccttfctc 

<210> 331 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 331 

accgccctgt ggctggaggt gagatatgct ggcagcaata ctgctctgtt actccttgct 60 

acactgagat gtttgggtaa agagaaacat aaatctagcc tacgtgcaca tctgggcaca 120 

gtacctttcc ttgaacttat tcgtgataca gattcctttg ctcacatgtt tccctgctga 180 

ccttcttccc acctgttgcc ctgctacact cccctcgcta agacagtaaa aataatgatc 24 0 

aataaatact gagggaactc agaggccagc gccggtgcgg gtccccccca tgctgagcgc 300 

<210> 332 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 332 
ggaaaaacaa caggtttgag tcctataaag 
acaagcttgt cctatgtcct atttgagtgg 
ggttgtcaag gttgtcccca gaccttgctt 
tgggccctct ggccagaggg aagcgggcag 
ggcttgttta ggattggagg gccaggtcac 

<210> 333 
<211> 300 
<212> DNA 
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gcctccatgt gtttttgatg 
tcattataga aacacaattg 
ctcactagta attctagtag 
tcatgtcatt tgtgaataaa 
cttattctta ccatatcaca 



cctttgaact 60 

ggttttatat 120 

ctttttttgg 180 

gccatttttt 240 

ttggcaaaga 300 



cctggtaagg gctgtcttcc tggcttgcag 60 

catgccccca tcctgtgccg ataagaactc 120 

gaagagagaa gcatctgaac atcaagaaga 180 

aaggagagga gtttggctga gctccagggg 240 

agttccccat cctgctgtca gccacattta 3 00 



ccataattta actccagtag 
cagcagcgcc agcccagcaa 
gcagtggttg gagaacccag 
ctctagccct ggagattgtg 
ctccccagcc accctccctt 



ctgatgtcag 60 

gaaggctggg .120 

ggggctgcct 180 

gtcacattgg 240 

ctctcctctg 300 
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<213> Homo sapiens 



<400> 333 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 120 

gatcaagtag cggaggccag gattttggcc acctcccggg caagttgcag ggcagtggcg 180 

ccgggagcaa aagcagcatg atgcagctca tgcacctgga gtccttttat gaaaaaacct 240 

cctcctgggc ttatcaagga agatgacact aagccagaag actgcatacc agatgtacca 300 



<210> 334 
<211> 262 
<212> DNA 
<213> Homo sapiens 



<400> 334 

gccatgccca tttgtttact cattgtctat ggttgctttc atgccctcac agcaaaggcg 60 

agtagttgtg atggatcaaa tggcccacaa agcctgaaat atttactctt tgacccttta 120 

cagaaaaaaa ccttgttgac ccctgcttta gagaatgaga agccatgcag ggatcagtga 180 

tgccagagga agggaaggaa ctgcttccag ctattgtgac aataataata ataataatat 240 

tgggtctttg actagaacgt gt 262 



<210> 335 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 335 

tcntntctcn ntattnttgn gtagtncctc ntttccttgt 
tttgcggacc ctcgattcta tctcatatga gtgagaacgc 
ggaaagcctt ccgagggcac tcggactttt ctaggcatca 
ggccttatat gtgtaatgaa tgtggaaaag ccttcagcca 
accaaaagtc tcacatgagt gagaagccct atgaatgcaa 

<210> 336 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 336 

gaggacccac tcccccagga ctcctttgaa ggcgtggacg aggacgagtg ggactagcct 60 

gcgcccccgt cacctccacc tcacctgtgc tgccacttcc tagtgcacac ctcacggctc 120 

atcctcaagc tggaagatac ctctctggcc ccggcacatg tcacccctgc actcctgcct 180 

tcccgtgggc acttccacat cctctgggcc tctggcagtt cccagggact gttttcacct 240 

ctgctgtctc tggggtcagc tgctgctcat cagctgcccg ctagcatgtg gccaggggtg 300 

<210> 337 
<211> 300 
<212> DNA 
<213> Homo sapiens 



ncnntnntcn nctnttgnct 60 

ttaccagtgc agcgaatgtg 120 

gagtcaccac agcagtgaga 180 

gaactcgagc cttaaaaagc 240 

tgaatgtggg aaggctttta 300 



<400> 337 
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agacaaccca gaaacaaatt catacatcta tggtgaccac ttttgacaaa ggaatgaaga 60 

acatacactg gggaaaagat aatgtcttta ataaatggtg ctgggaaaac tggatatcca 120 

tatgcagaag aatgaaacta gacccccatc tcttagcata tacaaaaatc aaaattaatt 180 

aaaaagttaa atctaagacc tcaaactatg aaacagctaa aagaaaacat cggggaatct 240 

ctccaggaca ttggagtggg caaagatttc ttgtgtaata cctgacaaac aggcaaccaa 300 



<210> 338 
<211> 292 
<212> DNA 
<213> Homo sapiens 

<400> 338 

tcaataacca tgaagatgca tcctaccacc gtcagggcaa tcattagata gctgatcttc 60 

actcgcatct tcatggttat tgagggcaag aaggctgccc aaagacacga gactttaaca 120 

agcttgaact tagaaaagaa agctcgtctg aaagaggaag cagctatgaa ggccaaaaca 180 

gagtagcaga ggtatccgtg ttggctggat tttgaaaatc caggaattat gttataacgt 24 0 

gcttgtatta aaaaggatgt ggtacgagga tccatttcat aaagtatgat tt 292 



<210> 339 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 339 

gaaatttgca ctgatggctc agaaggctta cgtcatggag agtatgacct acctcagagc 60 

agggggggct ggaccaacct ggctttcccg actgctccat cgaggcagcc atggtgaagg 120 

tgttcagctc cgaggccgcc tggcagtgtg tgagtgaggc gctgcagatc ctcgggggct 180 

tgggctacac aagggactat ccgtacgagc gcatactgcg tgacacccgc atcctcctca 24 0 

tcttcgaggg aaccaatgag attctccgga tgtacatcgc cctgacgggt ctgcagcatg 300 

<210> 340 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 340 

ctcagngcan cgatcatggc tcagtgcagc ctcaaactct tgggctcaan canagcgggn 60 

acctcaacct cctgagtagc taggactata ggcacacagc accatgcccc ggctattttt 120 

ttattttgta gagatggggt ctcactatgt tgcccaggct agtcttgaac tcctggcctc 180 

aagcaatcct cccacctcgg cctcccaaag tgctgggatt aaaggcgtga gccaccgtac 240 

ctggcccttg gtggaatctt tagggttttc tattcataca tataaaatca tatcattggc 3 00 

<210> 341 

<211> 296 

<212> DNA 

<213> Homo sapiens 

<400> 341 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg ttgggtatgg 60 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 120 

ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttt ttnnnnnnnn 180 
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nnnnnnnnnt tnncccccng ncngnanngc aggggcccaa nntnggntnn ntgnanccnc 240 
cnccnccggg ntnnncccct ttntcnngcc taacccnccc nagnacnngg aactac 296 

<210> 342 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 342 

ggcacgatca tggctcattg cagcctctaa ctccggggct caagcaatcc tcccacctca 60 

gcctaccaag tagctgtgac cacagctgcc cctcaccatg ctaagctaat ttttttaatt 12 0 

agatagtaca taaacgtccc aaaattagaa gataaaaaga catgagggat ccattctaat 180 

ttgtgtttgg agtgtaatgg tccagctcca ttcttctgca catggatatc cagttttaca 240 

caacactgtg aatgtaatga atgccactga atcatacact caaaaatagc taaaatggca 300 

<210> 343 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 343 

gttttcatca ctacatattc tacacacact gggaagctct gacaacttat tccctgctat 60 

tatcaactaa agatcaccct ttctactgct gtctctggag caggagctgg caaactatgg 120 

cctgctgtct gtttttgtac agttttactg aaacacagcc atgcccattt gtttactcat 180 

tgtctatggt tgctttcatg ccctcacagc aaaggcgagt agttgtgatg gatcaaatgg 240 

cccacaaagc ctgaaatatt tactctttga ccctttacag aaaaaaacct tgttgacccc 300 

<210> 344 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 344 

ccccaacctg cactctaccc acccccatca cctactccag ctcccaactt ttgtggactg 60 

agcggccgca gagactgggt cgccttggat tccctctgcc tccgaggacc ccaaaagaca 120 

cccccaaccc caggccagcc ggccctgctc tggcgcgtcc aaaatactac ctagcacagg 180 

cctctgctcg aggcaccccc aaactaccta tgtatccagc cccagagggc ctccattccc 240 

aggaagtccc tatgtatccc aacactggca gacacccagc accaccctcc cagacccgca 300 

<210> 345 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 345 

cccccatcac ctactccagc tcccaacttt tgtggactga gcggccgcag agactgggtc 60 

gccttggatt ccctctgcct ccgaggaccc caaaagacac ccccaacccc aggccagccg 120 

gccctgctct ggcgcgtcca aaatactacc tagcacaggc ctctgctcga ggcaccccca 180 

aactacctat gtatccagcc ccagagggcc tccattccca ggaagtccct atgtatccca 240 

acactggcag acacccagca ccaccctccc agacccgcaa gaaagtgaat ctcactacta 300 
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<210> 346 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 346 

gtccacggtg ctgaacatca tcatctttga agactgtagg aaccagtggt ctatgtcccg 60 
accactactt ggcttgatat tgcttaatga aaagtatttt tctgacctaa gaaacagtat 120 
tgtgaacagc cagccaccgg agaagcagca ggccatgcac ctgtgttttg agaacctgat 180 
ggaaggcatc gagcgaaatc ttcttacgaa aaacagagac aggttcaccc agaacctgtc 24 0 

agcattccgt cgagaagtca acgactcaat gaagaattcc acttatggcg tgaatagcaa 300 

<210> 347 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 347 

gctctgagcc caggcgaggc cagggacatg gccatggacc tgtgtcggca ggaccccgag 60 
tgtgagttct acttcagcct ggacgccgac gctgtcctca ccaacctgca gaccctgcgt 120 
atcctcattg aggagaacag gaaggtgatc agaccccatg ctgtcccgcc acggcaagct 180 
gtggtccaac ttctggggcg ccctgagccc cgatgagtac tacgcccgct ccgaggacta 240 
cgtggagctg gtgcagcgga agcgagtggg tgtgtggaat gtaccataca tctcccaggc 300 

<210> 348 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 348 

gttctgtggc tggcatggtc tgcctgctac tggagagatc tcctgagaat tcaggtttgg 60 
attggtgctg tcatcttcct gggaatgctt gagaaagctg tcttctatgc ggaatttcag 120 
aatatccgat acaaaggaga atctgtccag ggtgctttga tccttgcaga gctgctttca 180 
gcagtgaaac gctcactggc tcgaaccctg gtcatcatag tcagtctggg atatggcatc 240 
gtcaagccac gccttggagt cactcttcat aa^gttgtag tagcaggagc cctctatctt 300 

<210> 349 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 349 

gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag ggnattggtg 60 
gcacttcaaa gatcatgaag agcaagataa agtcagacct aaagccaaaa ggaaagaaga 120 
accaagctct atttttcaga gacaacgtgt ggatgcttta cttttagacc tcagacaaaa 180 
atttccaccc aaatttgtgc agctaaagcc tggagaaaag cctgttccag tggatcaaac 240 
aaagaaagag gcagaaccta taccagaaac tgtaaaacct gaggagaagg agaccacaaa 300 

<210> 350 
<211> 270 
<212> DNA 
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<210> 346 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 346 

gtccacggtg ctgaacatca tcatctttga agactgtagg aaccagtggt ctatgtcccg 60 

accactactt ggcttgatat tgcttaatga aaagtatttt tctgacctaa gaaacagtat 120 

tgtgaacagc cagccaccgg agaagcagca ggccatgcac ctgtgttttg agaacctgat 180 

ggaaggcatc gagcgaaatc ttcttacgaa aaacagagac aggttcaccc agaacctgtc 240 

agcattccgt cgagaagtca acgactcaat gaagaattcc acttatggcg tgaatagcaa 300 

<210> 347 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 347 

gctctgagcc caggcgaggc cagggacatg gccatggacc tgtgtcggca ggaccccgag 60 
tgtgagttct acttcagcct ggacgccgac gctgtcctca ccaacctgca gaccctgcgt 120 
atcctcattg aggagaacag gaaggtgatc agaccccatg ctgtcccgcc acggcaagct 180 
gtggtccaac ttctggggcg ccctgagccc cgatgagtac tacgcccgct ccgaggacta 240 
cgtggagctg gtgcagcgga agcgagtggg tgtgtggaat gtaccataca tctcccaggc 300 

<210> 348 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 348 

gttctgtggc tggcatggtc tgcctgctac tggagagatc tcctgagaat tcaggtttgg 60 
attggtgctg teat ct tec t gggaatgett gagaaagctg tettctatge ggaatttcag 120 
aatatccgat acaaaggaga atctgtccag ggtgctttga tecttgeaga getgetttea 180 
gcagtgaaac gctcactggc tcgaaccctg gtcatcatag tcagtctggg atatggcatc 240 
gtcaagccac gccttggagt cactcttcat aa^gttgtag tagcaggagc cctctatctt 300 

<210> 349 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 349 

gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag ggnattggtg 60 
gcacttcaaa gatcatgaag agcaagataa agtcagacct aaagccaaaa ggaaagaaga 120 
accaagctct atttttcaga gacaacgtgt ggatgettta cttttagacc tcagacaaaa 180 
atttccaccc aaatttgtgc agetaaagee tggagaaaag cctgttccag tggatcaaac 240 
aaagaaagag gcagaaccta taccagaaac tgtaaaacct gaggagaagg agaccacaaa 300 

<210> 350 
<211> 270 
<212> DNA 
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<213> Homo sapiens 
<400> 350 

ccatgctgnt aacgggtttc aaggggactc ttgaggaant gccccctaaa atagaacaca 60 

gcaatanggn gggcttcctg tccccaggnc caccccacag tgctntntgg cactggnaac 120 

tctgctangg agngantgna nnnnaccant aannnnnnan nnatcnacan nnnnnnnncn 180 

nnnnnncntn tnnccnannn ntannctncc ntannnnanc cnnccannan cactcncnat 240 

naacgnnnnn ttantgagan nttctcaact 270 



<210> 351 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 351 

aaatgactcc ctgcaaaacc caacccatgc tgctggctgt gggatttttg gtgtaagcct 60 

atctatgcac tctatcagcc agaatttggc atttagctct tagttaaatc tagtaaagga 120 

cagtctattg tttaaagaga aggtgcattt gttcctcaat caagcaagag cacctgtgtt 180 

gtactgcttt atatctcatg tatatttata gtaatgaaaa gactttttaa attgtacacg 240 

tttcagtgcc tttcttgtgt tatgaaaggc aggtagatat tatagccata ggtaaaaatc 300 

<210> 352 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 352 

aagaaatgcc tctatgtagg tgaagtgttc tctctgcatg caacagtaaa aattaatata 60 

atattttccc cacaaaagaa acacttaaca gaggcaagtg caatttataa atttatatct 120 

aaaggggaat catgattata agtccttcag cccttggact ctaaattgag gggattaaaa 18 0 

agaatttaaa ataattttga acgaatttat tttcccctca gtttttgagg gcattaaaaa 240 

ggcattaaat caagacaaat catgtgcttg agaaaaataa aattaatgaa aacacagcac 300 

<210> 353 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 353 

cccacactcg gacactgtgg aattctacca gcgcctgtcg accgagacac tcttcttcat 60 

cttctactat ctggagggca ctaaggcaca gtatctggca gccaaggccc taaagaagca 120 

gtcatggcga ttccacacca agtacatgat gtggttccag aggcacgagg agcccaagac 180 

catcactgac gagtttgagc agggcaccta catctacttt gactacgaga agtggggcca 240 

gcggaagaag gaaggcttca cctttgagta ccgctacctg gaggaccggg acctccagtg 300 

<210> 354 
<211> 299 
<212> DNA 

<213> Homo sapiens 



<400> 354 
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gaaggaggac ctaggcacac acatatggtg gccacaccca ggagggtagt ggggagttag 60 

atttcagagt ccaggcccta ggttgggacc cactccaaat aatctcctcg gtgtgggtgg 120 

tggttctata gagggataaa tgaataataa acattgttaa aatatacgaa aaaaaaaaaa 180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaacnncn ncnananaaa aaaaaaaaa 299 



<210> 355 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 355 

actgttcatc ctaagttcca ctataaacag gctcatgact cgggcacaga cacttcttgc 60 

gtgacttttt cctatgatgg taatgtcctt gcctctcgtg gaggtgacga ttcattaaaa 120 

ttatgggaca tccgacaatt taataaacca cttttttcag cctcgggtct tcccaccatg 180 

ttcccaatga ctgactgctg tttcagtcca gatgataagc tcatagtcac tggtacatct 240 

attcaaagag gatgtggcag cggcaaactt gttttctttg agcgtaggac tttccaaagg 300 

<210> 356 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 356 

ttcagaaaga aacatttaat agggacttac aaacaaatta atgtctgagt ctcaggtggc 60 

agcaagacaa gatggtggat ccccatgcca ttacctgcta gactcagggt ttatatactg 120 

tagtggagag gtgattccga aggaatgttg taagacaatt gaagagcagt aacatcaaag 180 

ttatttgacc taagggcagg agttacagta agtatccact tttatacaag aaacaataga 240 

taaactggaa atcttggagc ccttcctgga actggggtta atgagaagtc aacatggtgg 300 

<210> 357 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 357 

acaaaaccta cagatggaga taaaaattac tactgttatt caacatgtgt tccagaacct 60 

tattttgggg agtaaagtca attgggcaga ggatcctgcc cttaaggaaa ttgttctgca 120 

gcttgagaag aatgttgaca tgatgtaata agaattcatt tctgacatat tttacatttc 180 

tggcaatctc aactcttatt tggaatactt ctgtgcattt gtctgtccac cgtaatttta 240 

gaaaagcata tccataacgt ttacagttgt agtacagttg tggttagtta tttgtagtgg 300 



<210> 358 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 358 

ggtgattaca gaagcccaga aggttgatac cagagccaag aacgctgggg ttacaatcca 60 

agacacactc aacacattag acggcctcct gcatctgatg gaccctgcac ttgatggacc 120 

agctggcacc acccagatca ataaactggc ttatttgaat ttgcggcccc ccacccagga 180 
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actgactcag tgcaagaaga cagcttcgac tccctgtgat ttcatctctg accaatccgc 24 0 

actcctggct cactggcttc cccaacccat gaagttttcc ttaaaaactc tgctcccgaa 300 

<210> 359 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 359 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 60 

aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 120 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 180 

ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 

tgagcagggc caacaggagc tgaagaactt gcgtgcttgn gtcctggagc tggaagagcg 300 

<210> 360 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 360 

tctgtctggt gatttttatt ttaagtgaac ctttggatct atctttaact ctctttattg 60 

tgagtggtaa attccaattc tgcagcagat cagtaaactc acagtatttt tcctgtggaa 120 

atctattcaa taaggaaacc aagacaggat aataaaattt aaaaaaaaac aactttgaat 180 

tcccctgcct aggtcttcca gttgttttcc agcgcatacc tcaggtatga ctttgctagc 240 

cggggacaaa attagcacct tccgattctc tagtccaaat gaactttgtg ctaaataaaa 300 

<210> 361 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 361 

gtagaacaga aaatgagcat ccgatttctt cactaaagga gaccaaactg ttccttgcgg 60 

tctagtattg aagaactgga acttgaaagt cctccttcta ccaactccac ctccaccccc 12 0 

tcattcccct tctcccaaag tactactgct gttgcatgac aaccccaaat atgttctgtc 180 

aacacaaacc tgcctttggt gtataaacag ggcattacag aatggtacac cctatatatt 240 

tctgttcagt atccattcac tagttcttca tttataaata tcatcttccc cattctgctg 300 

<210> 362 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 362 

actaccccgg ctacggttcc cccatgcctg gcagcttggc catgggcccg gtcacgaaca 60 

aaacgggcct ggacgcctcg cccctggccg cagatacctc ctactaccag ggggtgtact - 120 

cccggcccat tatgaactcc tcttaagaag acgacggctt caggcccggc taactctggc 180 

accccggatc gaggacaagt gagagagcaa gtgggggtcg agactttggg gagacggtgt 24 0 

tgcagagacg caagggagaa gaaatccata acacccccac cccaacaccc ccaagacagc 300 
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<210> 363 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<400> 363 

ggcaattagc ctcgcttaag ttgccttttt tacacaccaa aactttttac atgaagggct 60 

ggtttcacat gaatactata ctgaaatctg tgctctcaag atctagcagt gaccagggct 120 

gcccggcggg ggctctcctg gcaagtcagg aaggtnnnnn nnnnnnnnnn nnnnnnnnnn 180 

canattantn nctgatcntc tntnangaan nnngantngc tctnttggnc nttgtnnnnn 240 

gncntnnnnt naantntttn ntnatgtngc t 271 

<210> 364 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 364 

agaggaccct gcagttaggg ggtgttactt tgtcgcccag gatggcctgg acccccaggt 60 

tcagggattc tcccgccgct gcttcctgag tagctgggac ctcaggcttc cgcctcgtgc 120 

ccgcatccct gctgtgttta ggcagcaggt ggtgacctca ctcctccctg gcctgagctc 180 

tccgtcccgc atcccaggcg gaggccctag ggaacacttt gaagctgagc acggggtgga 240 

ccctccctcc tgagtgaatg gagaatagaa agggagagga tttctgttct gttctgtggg 300 

<210> 365 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 365 

gttcttcaaa gccaaccaag acaggcttag cagttttaga gcttcagaac aaattgccaa 60 

aagccagagt tgtttatgct agtgcaactg gtgcttctga accacgcaac atggcctata 120 

tgaaccgtct tggcatatgg ggtgagggta ctccatttag agaattcagt gattttattc 180 

aagcagtaga acggagagga gttggtgcca tggaaatagt tgctatggat atgaagctta 240 

gaggaatgta cattgctcga caactgagct ttactggagt gaccttcaaa attgaggaag 300 

<210> 366 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 366 

gccagtcctc accttcccta gtcctcgtgt gtattttagg agatgcgtgg gtgtggaaca 60 

gcctcctgcc tccggtccag gtgtactggg gtctgtgtgt tgtgtttctg cgtgttctcg 120 

gcagaaagtg gcatgctgtc ccgcctgggt gatttgctct tttacactat tgctgaagga 180 

caggaacgaa tccctatcca caagttcacc actgcactaa aggccactgg actgcagaca 240 

tcagatcctc ggctccgaga ctgcatgagc gagatgcacc gcgtggtcca agagtccagt 300 

<210> 367 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 367 

cattgccaga gagcggtttc agcaagctgc agatctgatt gatgctgagc aacgaatgaa 60 

gaagtccatg tggggtcagt tctggtctgc tcaccagagg ttcttcaaat acttatgcat 120 

agcatccaaa gttaaaaggg ttgtgcaact agctcgagag gaaatcaaga atggaaaatg 180 

tgttgtaatt ggtctgcagt ctacaggaga agctagaaca ttagaagctt tggaagaggg 240 

cgggggagaa ttgaatgatt ttgtttcaac tgccaaaggt gtgttgcagt cactcattga 300 

<210> 368 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 368 

gcccggcccg gcgacgctgg cgacgctttc gcccctgagg tagtttggcg accgcgaaga 60 
aggaaaaagg gcgggcgggc ggctgtcctc tcaccgtcct caccccgcga ggcccggccc 12 0 

gctcctccgt cgtggatttc gcggcgatcc ccccggcagc tctttgcaaa gctgcttgaa 180 
acttctccca aactcggcat ggatacgact gcggcggcgg cgctgcctgc ttttgtggcg 24 0 

ctcttgctcc tctctccttg gcctctcctg ggatcggccc aaggccagtt ctccgcaggt 300 

<210> 369 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 369 

gtggggtgtg cctcgtgtgc gtggattcgt gtgtgtgtgt gtgtcttgta tatgtgtgcg 60 
cagagtgcat cattttcaga ctctactatt tccgtcaagt attctgtttg atttggatca 120 
tctcaggatc ggattctgtt ttagagtgtt tctgggccag gatccgggcc cctgccctcc 180 
tctgcacctg accacactcc ctactcaggg ctagtctgtt cttcccggac atcttctggt 240 
agccgtgcag gagagggctg ggtggggcag aggccacaga ggggacctgg tgtgtcacct 300 

<210> 370 
<211> 273 
<212> DNA 
<213> Homo sapiens 

<400> 370 

cagaggctgg ttcagaaaag gaggaagagg cccggctggc agccctggaa gagcagagga 60 

tggaggggaa gaagcccagg gtgatggcag gcaccttgaa gctggaggat aagcagcggc 120 

tggcccagga tgaggagagt gaggcctagc gcctggccat tatgatgatg aagaagctnn 180. 

nnnnnnnnnn nnnnnnnnnc atcatgtccn ntgcatggct acctatccca tatttnatnt 240 

ccctnncgtt gnttcnaatt ncacattntc ttt 273 

<210> 371 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 371 



88 



WO 99/33982 



PCT/US98/27610 



gatgaggagt gtttaatcat tgatacagaa tgtaaaaata atagtgatgg aaagacagct 60 

gttgtgggtt ctaacttaag ttccagacca gctagtccaa attcttcctc aggacaggct 120 

tctgtaggaa accagactaa tactgcttgt agtcctgaag agtcatgtgt tttaaaaaaa 180 

cctatcaaac gagtatataa aaaatttgat ccagttggag agattttaaa aatgcaggat 240 

gagctcttaa agccaatttc cagaaaagta ccagaattgc ccttaatgaa tttagaaaat 300 

<210> 372 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 372 

gggccccaat gcagctgccc tctccagata cctggcagcc tcatatatca gccaaagcct 60 

ggctcggcgg caggggcctg ggggaggggc ccccgcagcc tcccggggct cctggtcctc 120 

tgctcccacg tcacgggcat cttcgccgcc cccccagccc cagccaccac ctcccgcagc 180 

caggcggctc agctatgcca cgacggttaa catccacgtg ggcgggggtg ggcggctgcg 240 

gccagccaag gcccaggtcc ggttgaacca ccctgctctc ttggcctcca cacaggaatc 300 

<210> 373 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 373 

accctttctg ccttctgttt gggacccagc tggtgttctt tggtttgctt tcttcaggct 60 

ctagggctgt gctatccaat acagtaacca catgcggctg tttaaagtta agccaattaa 120 

aatcacataa gattaaaaat tccttcctca gttgcactaa ccacgtttct agaggcgtca 180 

ctgtatgtag ttcatggcta ctgtactgac agcgagagca tgtccatctg ttggacagca 240 

ctattctaga gaactaaact ggcttaacga gtcacagcct cagctgtgct gggacgaccc 300 

<210> 374 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 374 

tcaaggccta cgaacaggtg atgcactacc ccggctacgg ttcccccatg cctggcagct 60 

tggccatggg cccggtcacg aacaaaacgg gcctggacgc ctcgcccctg gccgcagata 120 

cctcctacta ccagggggtg tactcccggc ccattatgaa ctcctcttaa gaagacgacg 180 

gcttcaggcc cggctaactc tggcaccccg gatcgaggac aagtgagaga gcaagtgggg 240 

gtcgagactt tggggagacg gtgttgcaga gacgcaaggg agaagaaatc cataacaccc 300 

<210> 375 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 375 

cttcagtgca cacaacagga gagaggagaa agaagaaacg ctagtaattc caagcactgg 60 

aattaagttg ccttcatcag tgtttgcttc agagtttgag gaagatgttg tgattgttaa 120 

ataaagcagc tccagtttca ggacctcgac tggattttga tcctgacatt gttgcagctc 180 
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ttgatgatga ttttgacttt gatgatccag ataatctgct tgaggatgac tttattcttc 240 
aggccaataa ggcaacagga gaggaagagg gaatggatat acagaaatct gagaatgaag 300 

<210> 376 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 376 

gggagactgg ggtctatttc acccctgcag tctcgaccat aagagatggc tacacccagg 60 
ggggccagtt cagagaccca ctcccaggtg tgcattctct ttctcaagga tgttccttgc 120 
tgagaaaaag aattcagtga tatttctccc atttgcttgt gaaagaagag aaatgtggct 180 
ttgttccacc tggctcaccg gcggtcagaa tttaaggtta tctctcttgt ttcctaaaca 240 
ttgctgttat cctgttcttt tttcaaggtg cccagatttc atattgctca aacacacatg 300 

<210> 377 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 377 

gatcagccca cctcggcctc acaaagtgct gggattacag gcgtgagcca ccttgcccag 60 

cccacatcat acagtttgaa atgaaacttt gccacaacca gcctttgctg tagcacacac 120 

atatatcact gaacctgttt gaaataaagt tttttttctt tttcctctgg tattctgggt 180 

tctgaagtct ggtattctgg tattctgggt tcaaaagtat gacttgagag tgttgctctg 240 

gtattctgag agttgctctg tattctgggt tctgaagatt atttgaaaaa taactcctac 300 

<210> 378 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 378 

tcgctgtgat ccaaggataa aaaagttcaa ggaagaagaa aaagccaaga aagaagcaga 60 

aaagaaagca aaagcagaag ctaaacggaa ggagcaagaa gctaaagaaa aacaaagaca 120 

agctgaatta gaagctgctc ggttagctaa ggagaaagaa gaggaggaag tcagacagca 180 

agcattgctg gcaaagaagg aaaaagatat ccagaaaaaa gccattaaga aggaaaggca 240 

aaaacttcga aactcatgca agacctggaa tcatttttct gataatgagg cagagcgggt 300 

<210> 379 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 379 

acactataga atacaagcta. cttgttcttt ttgcaggatc ccatcgattc gaattcggca 60 

cgaggcagct tcgagccaat ggtgagctcc ttctggatca gctccttcag ctccttcttg 120 

ctcaggatgc tgaaattgca aggctgatgg aagacttgga ccggaacaag gaccaggagg 180 

tgaacttcca ggagtatgtc accttcctgg gggccttggc tttgatctac aatgaagccc 240 

tcaagggctg aaaataaata gggaagatgg agacaccctc tgggggtcct ctctgagtca 300 
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<210> 380 
<211> 296 
<212> DNA 

<213> Homo sapiens 

<400> 380 
acctggacag ggccagctgc tgggggagcg 
gtgtgtgtcc cctgtttact tttagctgag 
tgagccctgc ggcccacctg atgtttacgt 
nnnnnnnnnn ngtnatangc ttaacanatg 
atttgtgcat tnnaactatg cttttncgat 

<210> 381 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 381 

cagaaaagag tatagtaggg atgaccaagg tcaaagtggg taaagaagac tcatcatcca 60 
ctgagtttgt agaaaaacgg agagcagctc ttgaaaggta tcttcaaaga acagtaaaac 12 0 

atccaacttt actacaggat cctgatttaa ggcagttctt ggaaagttca gagctgccta 180 
gagcagttaa tacacaggct ctgagtggag caggaatatt gaggatggtg aacaaggctg 240 
ccgacgctgt caacaaaatg acaatcaaga tgaatgaatc ggatgcatgg tttgaagaaa 300 

<210> 382 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 382 

gccaccggtc tcttcctaat ctgcacagac tattttgggt atttctgggc gggcagttcc 60 
tttgcatgtt tcgggagagg tttgctgatt tggggcttat atgtcaggcc tttggtttgc 120 
gtcttatttt aggggttgtt tgggggcctg ggtggtcggc ctcacatggg aaggggatgg 180 
gtagtggatg gggtttctgt tgtatcttgt gggcgggtga ttttgctttt gtttttgttt 240 
cacattcttc cccctccaca agccaaagtc gtttcatttg gtttccactg tgtggactgt 300 



<210> 383 
<211> 273 
<212> DNA 

<213> Homo sapiens 
<400> 383 

gagatttgat attcgagtgc tgggcttagg tctgttgata aatctagtgg agtatagtgc 60 
tcggaatcgg cactgtcttg tcaacatgga aacatcgtgc tcttttgatt cttccatctg 120 
tagtggagaa ggggatgata gtttaaggat aggtggnnnn nnnnnnnngc cngcnttnac 180 
ttnatngcnn ctttttcttg atcnacgncn gnnatncnna nnngtntata ntaatncnga 240 
ananttnttt gnnntgcttt atcaantntt cnt 273 

<210> 384 
<211> 259 



gcactgggga ctggaggctg gaagcgggtg 60 

c^9999^9S gtgtacgggt tctgttcctc 120 

gtgtgtgtga gggggggcnn nnnnnnnnnn 180 

nanagncnac tnactnctga ttntttatnc 240 

cttnctgntg nnatnacngg catgat 296 
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<212> DNA 

<213> Homo sapiens 



<400> 384 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

cgaagagaaa gctggagtc 259 



<210> 385 

<211> 296 

<212> DNA 

<213> Homo sapiens 



<400> 385 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcat agcgccacgg gcagtgtggc ttgctgcccc ccaggaggcc tgaggctggg 240 

tctcactgct ctgaaaaaga cccnnccaaa atgggccttg gggctnnagg cccttg 296 



<210> 386 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 386 

gaagaggagg ctgtgtatga ggaacctcca gagcaggaga ccttctacga gcagccccca 60 

ctggtgcagc agcaaggtgc tggctctgag cacattgacc accacattca gggccagggg 120 

ctcagtgggc aagggctctg tgcccgtgcc ctgtacgact accaggcagc cgacgacaca 180 

gagatctcct ttgaccccga gaacctcatc acgggcatcg aggtgatcga cgaaggctgg 24 0 

tggcgtggct atgggccgga tggccatttt ggcatgttcc ctgccaacta cgtggagctc 300 



<210> 387 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 387 

ccgcagaggg cctggaagag gtgct caeca cgccagagac tgtgctcaca ggccacacgg 60 

agaagatctg ctccctgcgc ttccacccac tggcagccaa tgtgctggcc tcgtcctcct 120 

atgacctcac tgttcgcatc tgggaccttc aggctggagc tgatcggctg aagctgeagg 180 

gccaccaaga ccagatcttc agcctggcct ggagtcctga tgggcagcag ctggccactg 240 

tetgeaagga tgggcgtgtg egggtctaca ggccccggag tggecctgag cccctgcagg 300 



<210> 388 
<211> 300 
<212> DNA 

<213> Homo sapiens 
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<400> 388 



tggaggtctc ctttcgcccc agcccaggtg 
gagcacattt tgtagggtgg caccttttta 
aagagaaaaa agtgaccttt catttttttt 
tactgatttt ttttttctta aaactaaatg 
tttcatactg tggggcaaag tatttgtgct 



gccaagccca 
tccaagttac 
tcttgaaact 
catgactgca 
gctttttgga 



tcctggcctc agaacatgct 
tagctacaca tcagtgttta 
tgaggaaaca agatacatac 
gagcggtaga ggtgtatatt 
gatggactgg aacgtctggt 



60 
120 
180 
240 
300 



<210> 389 
<211> 293 
<212> DNA 

<213> Homo sapiens 
<400> 389 

gtcaagctgg ccctggatgt ggagatcgcc acctaccgca agctgctgga gggcgaggag 60 
tgcaggctga atggcgaagg cgttggacaa gtcaacatct ctgtagtgca gtccaccgtc 12 0 

tccagtggct atggcggtgc cagcggtgtc ggcannnnnn nnnnnnnnnn nnnatgaanc 180 
agntactcct atggnnttag cnttntanct atnacctgcn cnaactannc tnangtgcta 24 0 

gnncttgccc caacccctac ttttgtattt atattgtgtg tgcgtgtgtg cgt 293 

<210> 390 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 390 

ctcacacctg ctttggatgc ttcaagcacc tcagccctct gaactacaaa acagangagc 60 

ctgcaagtga caaaggaagt gaggcagagg cccacatgcc cccaccgttc acaccctacg 120 

tgcctcggat tctgaacggc ttggcctcgg agaggacagc actgtctccg cagcagcagc 180 

agcagcagac ctatggtgcc atccacaaca tcagcgggac tatccctgga cagtgcttgg 240 

cgcagagcgc cacgggcagt gtggctgctg ccccccagga ggcctgaggc tgggtctcac 300 

<210> 391 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<400> 391 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 

ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 120 

ctctcctctc tccttctctc tctgcccgga gctggtttcc gtctctcggc tcggggctgg 18 0 

aactccggcc caacctaggc gcgcagccgc cacgagatgg cgcacttccg atcaatgtca 240 

aagccgccgg ggagccc 257 

<210> 392 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 392 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 
cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 
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accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 393 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 393 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 

cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 

accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 

tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 

cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 394 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 394 

gtccatacat ggagctccct ggagcccgtg tgctctcgtg tgactgaacg ttttgtgatg 60 

aaaggaggag aggctgtctg cctttatgag gagccagtgt ctgaattgct gaggagatgt 120 

gggaattgca cacgggaaag ctgtgtggtt tccttttacc tttcagctga ccatgaactc 180 

ctgagcccga ccaactacca cttcctgtcc tcaccgaagg aggccgtggg gctctgcaag 240 

gcgcagatca ctgccatcat ctctcacnag gngaccatat tggtttttga cctggagacc 300 

<210> 395 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 395 

gcaaaatcaa tgtggactga acataaatca cctgatggaa ggacttacta ctacaacact 60 

gaaaccaaac agtctacctg ggagaaacca gatgatctta aaacacctgc tgagcaactc 120 

ttatctaaat gcccctggaa ggaatacaaa tcagattctg gaaagcctta ctattataat 180 

tctcaaacaa aagaatctcg ctgggccaaa cctaaagaac ttgaggatct tgaagcaatg 240 

atcaaagctg aagaaagcag taagcaagaa gagtgcacca caacatcaac agccccagtc 300 



<210> 396 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 396 

aagagcacaa gaggaagaga gagaccctca ctgctgggga gtccctgcca cactcagtcc 60 
cccaccacac tgaatctccc ctcctcacag ttgccatgta gaccccttga agaggggagg 120 
ggcctaggga gccgcacctt gtcatgtacc atcaataaag taccctgtgc tcaaccaaaa 180 
aaaaaaaaaa aaaaacnnnn nnnnnnnnnn nntntngggn gnctnntnnc nnaaanccan 240 
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ncttnataaa anccttngnt natttggaac aacccncann taaanngcag ggaaaaaaag 300 

<210> 397 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 397 

gataaatacc tcagcccctc gccttcctca acccacctgg caagtcttct taggatctga 60 

tcccagtttt ctggaagcaa tcctacccca gcccaagctt cccagagtcg agccttaatc 120 

cttctcactt ctcagtgtca gagcagaaat gaatcctggg gttgactgtg tccattcggg 180 

ttattagcag ctaagaagcc cagacgagta gtgtgagctg ccttgggagc ctcagtgagg 240 

gcactgggac tggcctcact ctcttgcccc cagcctagtg ggctttctcc tctgtctctc 300 

<210> 398 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 398 

ctgaacccta aaggaaagcc agcaaaccag ctgcttgctc tcaggacttt ttgcaattgt 60 

tttgttggcc aggcaggaca aaaactcatg atgtcccaga gggaatcact gatgtcccat 120 

gcaatagaac tgaaatcagg gagcaataag aacattcaca ttgctctggc tacattggcc 180 

ctgaactatt ctgtttgttt tcataaagac cataacattg aagggaaagc ccaatgtttg 240 

tcactaatta gcacaatctt ggaagtagta caagacctag aagccacttt tagacttctt 300 

<210> 399 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 399 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 120 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgctgaca tgcataagat gctgtttgag gaacgaaatc attttgccga gatagagaca 240 

gagttacaag ctgagctggt cacaatggag caacagcacc aagagaaggt gctgtacctt 300 

<210> 400 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 400 

ggctagcgat ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcaccgagt 60 

ttctggagtt cgaatttcgg ccggacggaa agcttagata tgccaacaac agcaattaca 120 

aaaatgatgt gatgatcaga aaagaggctt atgtgcacaa gagtgtaatg gaagaactga 180 

agagaattat tgatgacagt gaaattacaa aagaagatga tgctttgtgg cctccccctg 240 

atagggttgg ccgacaggag cttgaaattg taattggaga tgagcacata tcttttacca 300 

<210> 401 
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<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 401 

acccccttca tggacagatc ccccacagcc tggggcagaa gaggcgtcga gggcgccaga 60 

agtggcggca gcagcagccg cagcagccaa agagaggcaa gagaaagaga aagcgggcgg 120 

tggaggggtc ccggaagagc tggtccccgt ggttgagctg gtccccgtgg ttgaattgga 180 

agaggccata gccccaggct cagaggccca gggcgctggg tctggtgggg acgcgggggt 240 

gcccccaatg gtgcagctgc agcagtcacc actagggggt gatggagagg aagggggcca 300 

<210> 402 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 402 

ggatcctttc cagacagaag accccttcaa atctgaccca tttaaaggag ctgacccctt 60 

caaaggcgac ccgttccaga atgacccctt tgcagaacag cagacaactt caacagatcc 120 

atttggaggg gaccctttca aagaaagtga cccattccgt ggctctgcca ctgacgactt 180 

cttcaagaaa cagacaaaga atgacccatt tacctcggat ccattcacga aaaacccttc 240 

cttaccttcg aagctcgacc cctttgaatc cagtgatccc ttttcatcct ccagtgtctc 300 

<210> 403 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 403 

aattccgttg ctgtcggaca gattgcccta gtacccaccc acctatcagg gttatgcaat 60 

ggaacatcct cgcccaagct cttggagaag gcaaagacaa ctttgtacag tgccctgttg 120 

aagcactcaa atgggaagaa aggaaatgtc tcatcctgga agaaatcctg gcctaccagc 180 

ctgatatatt gtgcctccaa gaggtggacc actattttga caccttccag ccactcctca 24 0 

gtagactagg ctatcaaggc acgtttttcc ccaaaccctg gtcaccttgt ctagatgtag 300 

<210> 404 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 404 

agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa cttaacgaga 60 

ctcgtgccaa acttgatgag ctttctgcta agcgagagac tagtggagaa aaatccagac 120 

aattaagaga tgctcagcag gatgcaagag ataaaatgga ggatatcgaa cgccaagtta 180 

gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt agtgctgaaa 24 0 

gataagagca gattaagcag aggactaagt tggagcttaa agccaaggat ttacaagatg 300 

<210> 405 
<211> 856 
<212> DNA 

<213> Homo sapiens 
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<400> 405 



nattcngggg 
nctcaccnnt 
ncnntnncng 
aaanntnttt 
ttccnangtg 
caataatttc 
atnantcccc 
ttntntcccn 
nntcnncggg 



aaaaaaaaaa 



aaaananaaa 



tggtgccngt 
gcacgaggaa 
gagttagatt 
tgggtggtgg 



tcctattccg 
ggaggaccta 
tcagagtcca 
ttctatagag 
aaaaaaaaaa 
aatnaaaaan 
ggntttttcc 
tcttttggtn 
ttttttntnn 
ntccncnncn 
nnnngngtnt 
ttnncncccc 
tttattnttn 
ntntaanttc 
gttnct 



tgctntcgtn ctncnccagg ancnangcgt ntcgaattcg 
ggcacacaca tatggtggcc acacccagga gggtagtggg 
ggccctaggt tgggacccac tccaaataat ctcctcggtg 
ggataaatga ataataaaca ttgttaaaat atacgaaaaa 
aaaaaaaaan aaaaaaaana aaaaaaaaaa anaanaaaaa 
annanaaaaa aaaaaaaaaa aanncccctn cnccctaaaa 
tccannccnn ntntttaata nnctncttnt tgnntcttng 
ggcnntaana naaaatnttn nttttttttn ggntanaaat 
ttttttttcn aaaccctcct nttntanctc ncgtntcnaa 
nttnnntnnt nctntttcta tttttnnttc ttntncaann 
nntgnggctt gtttnttttt ncnncctngc gtcatccnnc 
nannccnnat tttttntnnc ctctatntnn gnngngnnat 
atnantagtc ntntnttttn ttntccntng tnatannatt 
ctcannnnat ttnntnnncn ncgngntata tttnangnta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
856 



<210> 406 
<211> 843 
<212> DNA 

<213> Homo sapiens 
<400> 406 

tnntnnnnnc gnangctggn nnnttctncc cntttctaat ngttnctaat actanggatn 60 

gtcacgaggn tcccangtag gcatagcgca ctgctgtacg tttttggttg tttttaagaa 120 

actcgatgaa gaggggtgtc attctgggct cggggtggtt gccaattttt caccagaaag 180 

ggagccaccc cttgcaacca cttctgtctc cgttagcccc ccctctgccc tcctccaagc 240 

caaagcgtgg cctggctttt gtcttcccat ttagttttcc tcttttaccc ttccttttgt 300 

gcttaattta ttaaaatagt tgctgtataa tttattttca taaactataa aaaaatacta 360 

aatggttaaa atagacttgc aggccaatct taaatggggt gggaggggtc tgagggtggg 420 

atggggaaag ggaaagaggt tttgatntaa acaaaacaaa tgcactttgg gtgtgtnnng 480 

gnattttnct ggggatanan ggggtggggg nnnnngnann nnnnnnnnnn nnnnnnnnnn 540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 84 0 

ncn 843 

<210> 407 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 407 

tgggcnggnn ctantngngg gctctcgact tcntacganc ttccaatggt tnggnggcac 60 

gagcccccat cttcactggt tattccactt atttaaaatg tccagaataa gcaaatctcc 120 

atatagagga agtagattag tggttgcttc gggatgggag gaatgggaag attgaggtct 180 

ttcttttgca gtgataaaaa tgtcctaaaa ttgactgtag cgatggtcac acaactctga 240 

atatgcttaa gaccattgaa ttacacactt tacgttggtg aattgtatgg tatgtaaatt 300 

atagttcaat aacatagtta caaaagataa tcaaaagcat gaaagcactg ttgatgtggt 360 
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ttggatctgt gtcctcaccg agtctcatgt tgaaatgtaa 
gggattatgg ggcagagtcc tcacaaacgg tttacaccac 
gatattgagt cctcatcaca tctggttgct tcaaagtgtg 
cctnctgctc tggccatata agatgtgcct gcttctcttc 
gtttcctgag gcctncctag aacaaaactg ctgtgctttc 
ggagccaatt naaccccttt tctttataaa aaaaaannnn 
nnnnnnnnnn nnnnnnnnnn nnt 



gccccctggt gggaggcgat 
ccgctcagtg ctggtctcct 
tggtgcctcc cctctatctc 
gccttctaac atgattgnaa 
tgnncccatc tacaggaccc 
nnnnnnnnnn nnnnnnnnnn 



420 
480 
540 
600 
660 
720 
743 



<210> 408 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 408 

tgtccgnttc ttangntgng ctctngcttc ctaggagtnt cnaatcgctt ggtgcagacc 60 

tccagcacaa gcctctgcta gtngatctca cggtagaaga aggtcaaaga ttaaaggtna 120 

tttttggttc acacactggt ttccatgtaa tngatgttga ttcaggaaac tcttatgata 180 

tctacatacc atctcatatt cagggcaata tcactcctca tgctattgtc atcttgccta 24 0 

aaacagatgg aatggaaatg cttgtttgct atgaggatga gggggtgtat gtaaacacct 300 

atggccggat aactaaggat gtggtgctcc aatggggaga aatgcccacg tctgtggcct 3 60 

acattcattc caatcagata atggctgggg cgagaaagct attgagatcc ggcagtggaa 420 

caggacattt ggatggagta ttnatgcata agcgagctca aaggttaaag tttctatgtg 480 

aaagaaatga taaggnattt tttgcattcc gtgcgatctg gaggaagtag cccaagtgtt 540 

tttcatgacc ctcaacagaa attccatgat gaacctggta accagaagaa ccccttggca 600 

cttatcttca tggcgttatt ctaatttaaa aagaacataa ctcatgngga cttatgccca 660 

gtctagaggc agaatcagaa ggcttgggtg gaacatatcg ntttcctttt tcctttcctt 720 

cggccctncc agnccagtcc atnttt 746 

<210> 409 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 409 

ggatccggtt tccaatgctc gggcnctcga gctncctaag annttgctaa tgcttggngg 60 

ngtgcctcgt gtgcgtggat tcgtgtgtgt gtgtgtgtct ngtatatgtg tgcgcagagt 120 

gcatcatttt cagactctac tatttccgtc aagtttctgt ttgatttgga tcatctcagg 180 

atcggattct gttttagagt gtttctgggc caggatccgg gcccctgccc tcctctgcac 240 

ctgaccacac tccctactca gggctagtct gttcttcccg gacatcttct ggtagccgtg 300 

caggagaggg ctgggtgggg cagagccagg aggggacctg gtgtgtcacc tgcccaccac 360 

ctggctcatc cctcangccc accctgaccc tacattacat aggttacgtc agcctactgt 420 

ggctgttgag caaagcattt ctcctttctn gggcctcatt gcactagatg ggcctgtggt 480 

cccaaagtag gtcagtaggt tggggttgct gacacccctt gggtgcaact ttgggacaag 540 

atgaantggc tctgtcctgt cactggcctc tccttgcctg ggggctatgt gcacttcaaa 6 00 

accctggcca agctcaagcc catgaagnat tggagaacac cctgggcccc caagaactgg 660 

angcacccgg ccanttcccc tgggattcca nctttgccan ggtgaaccct tcttttaccc 720 

naaacttntg tccccctgnt tccacttcca aaaanaactg g 761 



<210> 410 
<211> 748 
<212> DNA 
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<213> Homo sapiens 



<400> 410 



gatgccggtt cctatgatgn 
aggncttnta aatatatctn 
gaaaaancag ctttgtcctg 
anaggagtcc gggagatgtg 
tgccaaaaat gtggatttgt 
tctagagata aagaactata 
aaacntttaa tgccaaccca 
atgcacactc ttagggaaaa 
atttacaagt tggaaatttt 
atcacagtat aaaattctct 
tctgagaaaa atggtggcag 
aacttcctcc agaatnccag 
gcccnaagng ggaaaaaaaa 



gctctcggct tcctaggagt tccaanactn ggctngcncg 
ggntttanta ggtgataagt nctgtcantt agtancatct 
ggtgaaaaag gatgccaaaa ttgcctggaa aagagcagtg 
tgatgcntgt gaagcancat tgtttancat tcactgggtc 
ggtctgctta gattgttnca aggcaaagga aaggaagagt 
tgcttggatg aagtgtgtga agggacagcc tcatgatcac 
aattatacct ggttctgttt tgacagatct tctagatgcc 
atatggtatt aaatcccatt gncattgtct aacaaacaga 
cctncatgaa tggtgtatct caagtttaca gaatgtctta 
gngcatgcct gagtctcagc gccaaaatcc tcctccgaag 
cnnccccana aagtgatgtt nggcnccaga ttaccaggtt 
tcaccactgn actggntagc anatcttgcc gagccaaaaa 
aaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
748 



<210> 411 
<211> 773 
<212> DNA 

<213> Homo sapiens 
<400> 411 

gnangnnngn ttcntagtgc ccgtgggagt cttagatncc ctaaaaaatt gntaatgctn 60 

ggtcggcacg agtcaaggcc tacgaaacag gtgatgcact accccggcta cggttccccc 120 

atgcctggca gctnggccat gggcccggtc acgaacaaaa cgggcctgga cgcctcgccc 180 

ntggccgcag atacctccta ctaccagggg gtgtactccc ggcccattat gaactcctct 240 

taagaagacg acggcttcag gcccggctaa ctttggcacc ccggatcgag gacaagtgag 300 

agagcaagtg ggggtcgaga ctttggggag acggtgttgc agagacgcaa gggagaagaa 360 

atccataaca cccccacccc aacaccccca agacagcaat cttcttcacc cgcttgcaac 420 

ccgttccgtc ccaaacagag ggccacacag ataccccacg ttctatataa ggaggaaacc 480 

gggaaaagaa tataaagtta aaaaaaaagc ctccggtttc cactactgng tagacttcct 54 0 

gcttcttcaa cacctgcaga ttctgatttt tttgttgttg gttgttctct ccattgctgn 600 

tggtgcangg aagtcttact taaaaaaaaa aaaattttgn gagtgactcg gtgtaaaacc 660 

atgttanttt taacagaacc nanaagggtt gncctattgg ttaaaaaaaa aaaaaaaaaa 720 

aaacttngng cctttagaac tattanngag nccnatttac nttaatccan net 773 

<210> 412 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 412 

gnanncegga ttcntagcgn tcgtggaagt geateggctg ntaacaattt getaatgett 60 

ggagttccaa ttccagagaa aatgagtgaa tgggcacctc gacctccccc agaaatttgt 120 

ccgagatgtc atgggttcaa gtgctggggc cggcagtgga gagttccacg tgtacagaca 180 

tctgcgccgg agagaatatc agegacagga ctacatggat gccatggctg agaagcaaaa 240 

attggatgea gagtttcaga aaagactgga aaagaataaa attgetgeag aggagcagac 300 

cgcaaagcgc eggaagaage gecagaagtt aaaagagaag aaattactgg caaagaagat 360 

gaaacttgaa cagaagaaac aagaaggacc cggtcagccc aaggagcagg ggtccagcag 42 0 

etctgeggag gcatctggaa cagaggagga ggaggaagtg cccagtttca ccatggggcg 480 
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atgacaatgt ttgccacagc ctctgcctgg aacctggctc gtgctgtgac cagaagggaa 540 

aggcggctgt ttggctcttt cttccccgca aggaccccgc ttacccgctg gatggagagc 600 

aaaggagacc cccttccgag cccgntcaca gtcctgtatt tggcaaggtt tgggaacctg 66 0 

aaggggccaa tntnccttga cacttanang cacttgcctt tcagacacca ttccgngcnt 720 

ctggtaaaag gggacaagaa aagccttaac cttggcnnca tattttgaca gggg 774 

<210> 413 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 413 

gnngnnnnnn tttctaatgc ttgggnnnnn ngtcnatgcn taagagccan gcggntcgaa 60 

ttcggcacga ggcgggcccg gccagcggaa gcccctgcgc ccgcgccatg tcaaagaaaa 120 

aaaggactga gtgcagaaga aaagagaact cgcntgatgg aaatattttc tgaaacaaaa 18 0 

gatgtatttc anttaaaaga cttggagaag attgctccca aagagaaagg ctttactgct 240 

atgtcagtaa aagaagtcct tcaaagctta gttgatgatg gtatggttga ctgtgagagg 300 

atcggaactt ctaattatta ttgggctttt ccaagtaaag ctcttcatgc aaggaaacat 360 

aagttggagg ttctggaatc tcagttgtct gagggaagtc aaaagcatgc aagcctacag 42 0 

aaaagcattg agaaagctaa aattggccga tgttgaaacg gaagagcgac caggcttagc 480 

aaaagacttt cttcacttcg agaccaaang ggaacagcta aaggcagaag tagaaaaaat 540 

ncaaagactg tgatcccgca agttgtngga agaaatcgcc aagcaaatna agtagcccaa 600 

ggaactgctt acagatggac tgattacata ttcgcaataa aatcttnggc ccaaagaaaa 660 

atttngggtt tgaaggaaaa ttaaattggt tngaaccttt tggaatttcc cgaaagactt 720 

ttgcctncnt ngacttaaaa tatttccatg gnggtgaaag gttgtccaan ctt 773 

<210> 414 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 414 

gnagnnnnnn nttctaatgc ttggggnnnn nngtcaatnc ctnngancna ggcggntcgc 60 

tcatccagaa angtcagatc ancaaagaag tccangaaaa antgcgaccc agctngaccn 120 

tttgatccca ggcttagcac acgattgcat ggcntcccct ttagccactt naaccactgc 180 

agacntccag gaagctggac tctctcctca gtccntccag acttctggcc accacagant 24 0 

gaaaacccca ttttcaactg agctatcttt gctccagcct gatactccag actgtgctgg 300 

agatagtcat accccactgg ctttttcctt caccgaggac ttggaaagtt cttgtttgct 360 

agaccgaaag gaagaaaaag gggattctgc caggaaatgg gaatggcttc atgagtctaa 420 

gaagactatc agagtatgga gaaacacacc aaactacctg gggacaaatg ctgtcagccc 480 

ttaggcaaga ctaaattgga aagaaaggtg tctgccaaag aaaacaggca ggcccctgtc 54 0 

ctccttcaaa catacaggga atcctggaat ggagaaaaca tagaatcagt gaaacaaacc 600 

cgtagtccag ttctgngttt tcctgggata tgaaaagaat gaccanggac tnctggagtc 660 

aacttttcac ttgaagaatc tcaaggccac cggtcattgg ccacacactn gaactccttt 720 

ttaagatgta cccattactg gaattgggct taggg 755 



<210> 415 

<211> 852 

<212> DNA 

<213> Homo sapiens 
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<400> 415 

gnagnaannn ttctaatgct tgggnnnnnn ngtcaaacct tannaacctg gcntgncgaa 60 

ttcggcacga ggtcacaggc aggtttantg gccagtttaa aacttatgct ntctgcgggg 120 

ccnttcgtag gatggtgagt gtttccctgg gctttgctca tcacttcggg acatcgtgga 180 

ctttaccgtg cgcnttggag tgtgtgatgg tgcctgagta gatctgctgg cagagtagtt 240 

tgagccagct ggactgggct ggccgcctgc cgcttcttga gggtggaaga ggggtgctct 300 

gagaagacac tcaggcagca gactctgcct ctcactagga ggtgcccccc cgaccccgtt 360 

ccaccatagt caaggctgca ggctgccccg ggagaagtgg ctccccttct tgcgcctgtc 42 0 

ttccattcgc ttcaccgggg gganaagacn ttgggcttgg ttggcacagc ntgacccttc 480 

tgcccatctt naaggcagnc ccggaantgg gaaaaatatt tctttaaatg gtggcctttn 540 

nttttttttt nctttnaaag gggttgaagt tccannaatg natttcccaa tttccttccc 600 

gaattgggnc ccaaagggcc ccaatggggc antcggtcct ttaaaaagna acctttttgg 660 

acctgggaag aagaaaatca cccagattgt tgggaaatat tttggncatt aaaataaant 720 

aatggaaaac ctnaaaaaaa aaaaaaaaaa aaaactcgag cccnttaaaa acttttagtg 780 

agtcnnatta ccnttanatc canacnttga tangaanctt tggataattt tgggncaaac 840 

cnnaacttng at 852 



<210> 416 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 416 

ggnnnnnnng tnaaaccttc cnaannaggc tnggcgtcac tgnccccggt caacaaaccc 60 

acttttatga cagttttctt ccgcagcttg gctnttaaat tttactggca ggtgtatggt 120 

tgttggaggg ttcctagtga gttgggggac ctggcantan agctgcttgg ttggaggaag 180 

tgaanctggc ttantaccag cagctgatct cttccacgtg ctgctgcttt ttttgccact 24 0 

ctgatactaa accagagaaa gctgcaggtg gataaagaag ctgtggctgt tttttgcttt 300 

tgggtggcaa tgagaaagag tcacagtgtg ggttaaaggg atctgcagtg gggccaagga 360 

tgccacccca ccctcagctg tangcaagct tgcacataaa taacccccgt cagtggagtg 420 

ttcgggatgc agggggcant atagtgttct tggactttgt ccgtcctggg gcagttttta 480 

agttctttat atttaagtgg ggtcagtgcc aagtgctacc actttcccaa taaangaatg 540 

ggggacccan aaggctgggg tccctggcta ccttgttatg aaggttttgn tntttctctg 600 

acaaganttg ctttggaaag ancctgtttt taggggatta ttttttgnat accccgatgg 660 

gganccaggg ttctnctcaa aacccttaca acccttagga tcatagggaa aaggggcccn 720 

tnttttnctg ctggcttncc caacttaaaa acnt 754 



<210> 417 

<211> 755 

<212> DNA 

<213> Homo sapiens 



<400> 417 

ngtntatagc ttnctaatgc ttcntancga attcggancg agagaagccn tgagcagcaa 60 

agtctntcgc gacaccctgt acgaggcggt gcgggaagtc ctgcacggga nccagcgcaa 120 

gcgccgcaag ttcctggaaa cggtggagtt gcagatcagc ttgaagaact ntgatcccca 180 

naaggacaag cgcttttcgg gcaccgtcag gcttaagtcc actccccgcc ctaagttctc 240 

tgtgtgtgtc ctgggggacc agcagcactg tgacgaggct aaggccgtgg atatccccca 300 

catggacatc gaggcgctga aaaaactcaa caggaataaa aactggtcaa gaagcttggc 36 0 

caagaagtat gatgcgtttt tggcctcaga gtcttttgat caagcagatt ccacgaatcc 420 

tcggcccagg tttaaataag gcaggaaagt tccctttcct gtnacacaca acgaaacatg 48 0 
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gtggccaaag tggatgangt gaagtncaca atcaagttnc aaatgaagaa ggtgttatgt 
ctggctgtan cttgttggtc acgttgaaga tgacnngacg atgaancttg gggtataaca 
ttcacctggc tgtcaacttc ttggnggtca attgcntcaa agaaaaaact tgggcagaaa 
tgttccnggc cttatnttnt caagaaccnc catggggcna agccccaacg ccctttnttt 
aaaggcncat ttggaattaa attcntnttt ncccg 



540 
600 
660 
720 
755 



<210> 418 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 418 

tggggnntnn nttctaatgc tgggatgttc taaangntgg gctactcgtt ctttccgcag 60 

gancccntcg attcgaattc ggcacgagga aaggtggcgc gcttctcacg gctgagttgc 120 

tgcgcctgca gacggaagct ccccacaggc agagctgctt ggatgtgtga gtcatgaagc 180 

cagagaagcc ccgctccatg agcagtgact ccccaggccc tgtgacctcc ctcctgtctt 240 

gcagctcctc ctggcaccag tccccagggc tctcctgttg gtagttcctg cttttcttct 300 

tggaaattcc tcgtggacct cgagatcttt accctaaaat agttctgttg aatttcaccc 360 

tggcaatgta aattgatagc ttatcttcac agatgccaga caatggacaa ctcaccatca 420 

gtcctctgct cacctgagac aaatgcatgt ctgattgctt cctctgccct attgnttatg 480 

tgaaaatgca gattcactga gccagactaa ggcatcagtg actgttcctc tactgcctct 540 

cacatggaga ttgtgtattc agtgaaaggc tgatcaaaga ccccaaagga atgcaccagt 600 

ttatctctta tctacctatg acctgcgagc tgnccaccac ccccagttgt tgcgcctttc 660 

cagacagaac cagtgtcatc ttacacgtat taattggatg tcctgngnct tccttaatat 720 

gtatcaaaac aagctngcct tgaacacctt gggcacn 757 

<210> 419 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 419 

gnnngncgtt cnaattncgn ggnntctttc tngccnanna nnannngcgt gngngaattc 60 

ggcacgagac tgttcatcct aagttccact ataaacaggc tcatgactcg ggcacagaca 120 

cttcttgcgt gactttttcc tatgatggta atgtccttgc ctctcgtgga ggtgacgatt 180 

cattaaaatt atgggacatc cgacaattta ataaaccact tttttcagcc tcgggtcttc 240 

ccaccatgtt cccaatgact gactgctgtt tcagtccaga tgataagctc atagtcactg 300 

gtacatctat tcaaagagga tgtggcagcg gcaaacttgt tttctttgag cgtaggactt 360 

tccaaagggt gtatgaaata gacatcacag atgcgagtgt tgttcgctgc ctgtggcatc 420 

caaagctgaa ccagatcatg gttggaactg gaaatggatt ggctaaagtc tattacgacc 480 

ccaacaagag tcagagggga gcaaaattat gtgtggttaa aacccancgg aaggcaaaac 540 

aagctgagac tctactcagg actacatcat cacccctcat gccttgccta tgttcccgtg 600 

agccccgnca acggagtaca aaggaaacag ctggagaagg acagactgga tccctgaagt 660 

cgcattaacc tgaacctcct gtancangcc cangtcgtgg tggccgattt ggaacccacg 720 

ggggcactnt tttttcct 738 

<210> 420 
<211> 739 
<212> DNA 
<213> Homo sapiens 
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<400> 420 



gcgntnntat tagcgtgggc 
acgagaatca gaggaggctt 
gcagaagaag atgttgttca 
aaactgtctg tgaaattgaa 
gactttgatg tgtaactgtg 
gtcacctgga aagaacagag 
ttgatagact tagcctagta 
tagcatcaaa aacttctggg 
tcactttcaa agagcatcta 
tcagaacaga tgcttgaaat 
catgaagaaa atgaagctgc 
gatttnggtt cacccttggt 
aattttacag tccgggaan 



tcgntctcgc tcnacncanc nngngctggn cgaattcggt 
cttcatcctt caactccatg atgaactcct atatgaagtg 
ggtagctcag attgtcaaga atgaaatgga aagtgctgta 
agtgaaagtg aaaataggcg ccagctgggg agagctaaag 
ctgttgatga agtcctccca gggaagcctg tgcagatgca 
attccctttc acctacctca gcaaaacaaa ctttcaagtc 
attttatagt gagagtttca aactatatat caagtgtcta 
ggcgtggggg aaagtagaat accaagtata atagttacat 
tgaatttgcc ttttgtaact tactgtggct ttaaacatat 
atgcacttag cactttggtt ccacatctgt ctgggtaaac 
tgcctcaatc gancccagac agcagccata ggcagataaa 
ggtgggaggc atcgtgtgtg cctttttttc ctctaatatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
739 



<210> 421 
<211> 727 
<212> DNA 

<213> Homo sapiens 
<400> 421 

gtgatctttn tgagtggggg ccntnctngc tctannanat aggttnggng ggctagcgat 60 

ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcacgagtt tctggagttc • 120 

gaatttcggc ccggacggaa agcttagata tgccaacaac agcaattaca aaaatgatgt 180 

gatgatcaga aaagagctta tgtgcacaag agtgtaatgg aagaactgaa gagaattatt 240 

gatgacagtg aaattacaaa agaagatgat gctttgtggc ctcccctgat agggttggcc 300 

gacaggagct tgaaattgta attggagatg agcacatatc ttttaccaca tcaaaaatag 360 

gttctcttat tgatgtaaat caagtcaaag gatcctgaag gccttcgagt attttactat 420 

ttggtacaag acttgaaatg tttagttttc agtcttattg gattacactt caagattaaa 480 

ccaatttaaa ttgtatgttt tcaagctggt tgnatattta attaaaggga tgggaagggg 540 

ttatttgtca tttacagtat tggggtttta tgaatgtgaa gcaaccaaaa aaaattnnaa 600 

tgtaaaactg gaaaatagga aaattcatta ncagcttaat gggtatcctt acttgatncn 660 

ctgggtttgg aagtccccac acacattaaa tctgtaatga aancnctttt ggttaaaatt 720 

tctctat 727 

<210> 422 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 422 

gtntngnnng nngttnnatt atatggntcg nctnnctcna nnancnangc ttgngctgac 60 

aacttgattg ggttctcctt caggtttgaa gcgccctcna gaagtgtcta aaggagacag 120 

ttgatagcca aacaacagtt ttggattcac tgactgatta tgaaagaagc agtagactgg 180 

tatcaagaat cagtcagcaa ggaggccctc accagacgcc agtgccatgt tcttggactt 240 

ctcagcctcc atattcatga actaagtttt tggaatcctt aggcttccac gtgtggaaag 300 

cctgagctaa cctactggag gatgagccat cacctggagc agattcaggc catcctagtt 360 

gaagcctccc taggccaagc aaccgtccaa ctaccagaca ttgaccattc agccttgaac 420 

attcagcaca aagacaaaac agaccagacc agaagagtcc cacagaatag gggaaactat 480 

tcagagaaaa cttaagccac taagttttat ggtgttttgt tcttgtagcc agaagcatag 540 

gcatactggc caatacaaac cgaaatcctt ctaacgtant ggaccctttt caggccagca 600 
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ttttttccct tgaaaacctg ggagccttgt attccatctt attagcagaa gatcactttc 
accaatggtt tgggctcttg atttggaatt gatgatgtaa tgagcctnta ttcnanatgn 
gacttaatac ctctgcgaat tgactggatt ccn 



660 
720 
753 



<210> 423 
<211> 844 
<212> DNA 
<213> Homo sapiens 

<400> 423 

nggnnnttnn nnnnnatncc ntgatcgtgt ntcgttcttt ctncaggatn nnntcgtttc 60 

gaattcggca cgaggaaaag ggagccgcgc agngcctacg ggagtncggc ggcagcagcc 120 

ggtaccggca accacgggca gctctcaggg aatctccgtc gttgaggcca naggctccag 180 

tccccgcgag tccagatgcc tgtccagcct ccaagcaaag acacagaaga gatggaagca 240 

gagggtgatt ctgctgctga gatgaatggg gaggaggaag agagtgagga ggagcgganc 300 

ggcagccaga cagagtcaga agaggagagc tccgagatgg atgatgagga ctatgagcga 360 

cgccgcancn agtgtttcag tnagatgctg gacctggaga agcagttctc ggaagctaaa 420 

nggagaagtt gttcaaggga acgacttgan tcanctgccg gnttgcggct tggaaggaaa 480 

ntgggggggc ttgaanaaga agcccctgga atnccaccgg aagccccctt ttgggggggg 540 

gccttgcaaa ccgggaancc ctttnaaagg aatttcngcc antttcaang gttgggccaa 6 00 

ggggaatcnt accnaagggg ccttctnggc cttggnatgg tgaatccang gnaaattaag 660 

gtncccaatt gntgaancct tccaanggga ancccaaacc agcacccttg naanaagttg 720 

agaaaacttg cttgcntctt ntgacacccc tncnaggggg aacttcaagg aaccggttcc 780 

tnaggcttgg aaggaggacc cccananccc tggancctaa attnttaaat gggtnggacc 840 

accn 844 

<210> 424 
<211> 799 
<212> DNA 

<213> Homo sapiens 
<400> 424 

ggagnnnngn ntccnaattn nntgggnnnn nnngtcaaan nctngctact cgttctttcc 60 

gcaggatccc atgcgattcg aattcggcac gagcccagac ctatggagtc agacagtagg 120 

tttgaggccc agcaatctat ggtttaacaa gccatccagg tgtttctgat gcacagtgaa 180 

attggggtac cactggtatt aggtttggta tggcaacttt ttcatcactt gttttatgta 240 

gttgtctgat caattgtgaa aacataatga atgttggaaa tggaacagta aaataacgaa 300 

agccaacttt tttttttttt tttgagacgg agtcttgctc tgtcgcccag gctggagtgc 360 

agtggcgcga tctcggctca ctgcaagctc cgcctcctgg gttcacgcca ttctcctgcc 420 

tcagcctccc gagtagctgg gactacaggc gcccgncacc acgcccggct aattttttgn 480 

atttttagta gagacggggt ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 540 

gtgatccacc cgcctngggc ttccaaagtg ctgggattac aggcgtgagc caccgggccc 600 

gggccaaaag ccaactcttt atgcctagaa aatattgtgc accctatgac ccaagcccat 660 

tgaatttttn cngggaaatt tatggtaaat tattgaaatg gatggtacct ttaaaaagtt 720 

atttggcaca ttccccttgg gttacctttg gnatggtttg ccagggaatt naaaactttg 780 

ggntnaaacc ttttttann 799 

<210> 425 
<211> 750 
<212> DNA 
<213> Homo sapiens 



104 



WO 99/33982 



PCT/US98/27610 



<400> 425 



gangccggat 
ncccatgcga 
cctcagaaca 
cacatcagtg 
aacaagatac 
tagaggtgta 
ctggaacgtc 
ccacagtgag 
ggactccggc 
tagaactttt 
taccaacaag 
ctcctgaaaa 
aaaactttan 



tccaattntc 
ttcgtggagg 
tgctgagcac 
tttaaagaga 
atactactga 
tatttttcat 
tggtttctgt 
acctggagcc 
ctgtttctac 
tgatagtgtg 
aatgtnatag 
aaaaaaantt 
gggncgttta 



nggctnctct naaannctgt ntaatgcttg gtccgcanga 
tctcctttcg ccccagccca ggtggccaag cccatcctgg 
attttgtagg gtggcacctt tttatccaag ttactagcta 
aaaaagtgac ctttcatttt tttttcttga aacttgagga 
tttttttttt cttaaaacta aatgcatgac tgcagagcgg 
actgtggggc aaagtatttg tgctgctttt tggagatgga 
ccccgggccc ggcagctacg tctattttct gtagaaggtg 
accccttcct gcctggcgcc gtttanagct gggagcccgt 
cttctattca accactctga cgtggggaga caagaagaaa 
gtaaaaacat tggattttga actattttag taaaaggagt 
gtgctacttt gagctagata aataaaggct ctttgtgagc 
nnnnnnnnnn atnannnnnn annaaaaaaa ctggnccttt 
cctanaccct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
750 



<210> 426 
<211> 819 
<212> DNA 

<213> Homo sapiens 
<400> 426 

gnagnncggn ttcttatgat cgtggctnct cntctanngg ttgtgtaatg ctnggtcnnc 60 

angannnnnt gcganncgaa ttcggcacga aggggggttc ccaatagtag aaaagggtcc 120 

ccattcctgc tcagcaccgc acctctctac ccccccacag acacacatgc agacacacac 180 

atgcagacaa cacgcagaca cacacatgca ggcactcaca tgcaggccca tgcacacaca 240 

cgtgcacaca catgcagaga catgcagaca cgcaggcaca catgcacaca tgcaaagaca 300 

cgcatgcagg cacacgcaga cgcacacaga gacacacatg cagatcacat gcacacacac 360 

atacacacac tggcccctgt ttttctgtgg tgtcactggg tgccagcaac tcggtatctn 420 

ccaccttcca ctaaaacctg ggccttaatt tctctcccgt ccccacccct aaattcctga 480 

tggatgaacc tagagctgtc ctgtccactc caggccggac tgacgtancc tatgggccca 540 

gcaggtccag ggcccacgtt ttaatttctt tttnaaaagc tttaggtctt ggccnggccg 600 

ccggtggttc acgccttggg agttcccagc atttttnggg aaggccnaag gccgggttgg 660 

attcacaaag gtcaagcaag tttcaaggaa ccaagccttg aaccaggcca ttgggtgagg 720 

aaccctgggc ttnttactng ggnaaattcc caaaaaaaaa ttggccttgg gccnaagggt 780 

gggcaagggc acccttgttg gggtccccaa antttacct 819 

<210> 427 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 427 

gagnnngatt cnaattnctg ggctnctctc ttnntatnta atgctgggtc cgcangancc 60 

nntgcgattc gaattcggca cgaggtccaa ggacaacttc gagacatttc tttttgccac 120 

cgtatctaac agggagcagg aagatctctg ccgaggaatt gtccagctct gcttcaatga 18 0 

gcaaagccaa cagctgctag cagaggtcca gccctctgac tctttcctca tggtagagac 240 

aactgcatac tttgaggcct acaggcacgt cctggaagga ctccaggagg tccaggagga 300 

agatgttccc ttccagagga atatcgtgga gtgtaactct catgtgaagg agccaaggta 360 

cttgctaatg gggggcagat atgactttac ccccttaata gagaatcctt cagccactgg 420 

ggaatttcta agaaatgtcg agggtttgag acatcccaga attaatgtct tagatcctgg 480 

ccagtggccc tcaaaagaag ccctgaactg gatgactcca gatggaagcc ttgcagtttg 54 0 
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ctctcacaag ggaactggct attattcaag gaccttctgg aacaggcnaa acctatgtgg 600 

gtctnaaaaa ttgttcaagc ccttctacca acgagtcttg tttggcaaaa ttaaccttca 660 

gaaattccca tcttggttgn gtgtatacta atcatgcttt ggaccanttc tggaangctt 720 

ttccattgtc agaaaaccan atttggccgg 750 

<210> 428 
<211> 943 
<212> DNA 
<213> Homo sapiens 

<400> 428 

gnngnccggt ttcctattct cnggcanctc tcttcctncn acctattanc tggactctaa 60 

anaaaagnnt gnngcggttg gctcaagggc caccanaaca tttctttatt attattattt 120 

tttaacctgn acatgcntta aagggtctat tacctttctt tccgtctgtc tcaacagctg 180 

aaatggggcc nccaaggagt gccttccttt tgctccctcc tactgggact gacggntggn 240 

antgtntgnn cccanntggg ggtgtctcct gnctgggaag ganggaaagg gaggcanagt 300 

tttgccgggg ttgcanntng acancangct gnanaggana tggctaataa ctgtttaatg 360 

gaaacctgct tgggcttgga nggaacttag nctgaatttt cccgacttcc tctgccagtt 42 0 

attgacacan tctctttnta agacangaaa taaactaaac cccaccccaa ggnantnatn 480 

ncangcngaa aacnncncat ngcccacatt ncctnatccc ntancaccnn ctcnttnttt 540 

nncccaanac tncttcccan ntntcnccnt ttacccntan ncntnnttnt atcccnctaa 600 

tncctnanim ccntnnttnt ccnnatnctt acncncncnn ntnnnncccn nntctttnnn 660 

cccaaanctn ncctcncnnt tcnnctnaac cntntnnnca nnanacaccc ttctnatnnc 720 

ccannntctn cacnntnnnt ntctccnnnt nnncnccnnn ntcntnnnna nancntntnn 780 

nanancnatc tnntncncnn cnantnnnnt tcanttcacn ctctnnnnnn tntanccnat 840 

tnnccntcnn tnnccnntta nnncntnnnn nncnaantnn nnnnanctct ncnnncnnct 900 

canntnnnnt nnnnnncnnt cnanaccntn nnnntctatn ccc 943 

<210> 429 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 429 

gnangnnnnn nntttctaan tncttgggnn nnnngtcann gattnngcta aaggttngga 60 

tcncncgcag naangctgtg gcgctccatt gtgaaagatc caggcatttt tccgagccag 120 

gaaaagccca agatgactac aggatattag tgcatgcacc ccaccctcct ctcagtgtgg 180 

tacgcagatt tgcccatctc ttgaatcaaa gccagcaaga cttctctgct gctgtgatct 240 

gcacaccctc caacctgggc agggactggg gggatgcagt gtgtgttagt gcccatgtgg 300 

cattgtggca ctgttgcccc ccatggcggc atgggcaaga tgaccttcca ttagcttcaa 360 

gtcttgttct cttgtctgtg gtctgtttaa tatgtgggtc actagggtat ttattctttc 420 

tcccatcctt acactctgga tcattgtgca gacttaatca gggttttaac gctttcattn 480 

nnnnnnnnnt ttttttgagc tcaaagaaag ttctcatttt ccctattcaa ctaataccca 540 

tgccgngttt tttaccttgg atttaaaggc accttangtt ggggcaacag attctcactc 600 

atgtttaana cctggnattc ancttcataa gaccaaagan ggagctttcc ctttctcttt 66 0 

acccctnagg attctcatcc tttacanntn gactttttcc aggccaattt cccatnnaat 720 

ctgcnanncc cngccttttg ncccaagctt ttntgntngn ccccccattt acccn 775 

<210> 430 
<211> 763 
<212> DNA 
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<213> Homo sapiens 
<400> 430 

ngggtgnnnn nntttctaat nctggggnnc nntnnncnnn ntttcctaat ncttaggngc 60 

tcgttctttc tccangcagn nnngcgtttc gcgacagctc tccaatactc aggttaatgc 120 

tgaaaaatca tccaagacag ttattgcaag agtttaattt ttgaaaactg gctactgctc 180 

tgtgtttaca gacgtgtgca gttgtaggca tgtagctaca ggacattttt aagggcccag 240 

gatcgttttt tcccagggca agcagaagag aaaatgttgt atatgtcttt tacccggcac 300 

attccccttg cctaaataca agggctggag tctgcacggg acctattaga gtattttcca 360 

caatgatgat gatttcagca gggatgacgt catcatcaca ttcagggcta ttttttcccc 420 

cacaaaccca agggcagggg ccactcttag ctaaatccct ccccgtgact gcaatagaac 480 

cctctgggga gctcangaag gggtgtgctg agttctataa tataagctgc catatatttt 540 

gtagacaagt atggctcctc cgtatctcct cttcctagga gaggagtgtg aacaaggagc 600 

ttagataaga caccccttaa acccattccc ttttccagga gacctaccct tcacaggcac 660 

aggtccccaa atgagaagtc tgctacctca tttctcatct ttttactaaa ctcaaangca 720 

ntgacagcag tcagggacag acattcattt cttnatacct tec 763 

<210> 431 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 431 

tggtgttnnn ntcctaatgc ttggnngnnn ggtnannctt ctaattactt tggggctcgt 60 

tctntctcna ennngenngg cgttncgaat teggcacgag ettgaagege tggtttttct 120 

cgaagcaatc cttattatat tgttaaacaa ggaaagatca accagatggc aacagcacca 180 

gattctcaga gattaaagct attaagagaa gtagctggta ctagagtgta tgacgaacga 240 

aaggaagaaa gcatctcctt aatgaaagaa acagagggca aacgggaaaa aatcaatgag 300 

ttgttaaaat acattgaaga gagattacat actctagagg aagaaaagga agaactagct 360 

cagtatcaga agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa 420 

cttaacgaga ctcgtgccaa acttgatgag etttctgeta agegagagae tagtggagaa 480 

aaatccagac aattaagaga tgctcancag gatgeaagag ataaaatgga ggatatcgaa 540 

cgccaagtta gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt 600 

aatgctgaaa gacaagaacn gattaagcag aggactaant tggagcttaa ageccaagat 660 

ttacaagatg aactaccggc aatagtgaac aaaggaaacc gttttttaaa agaaangcen 720 

aanctgcttg aaaaaaaaaa aaaaaaactc ggectntaan t 761 

<210> 432 
<211> 748 
<212> DNA 

<213> Homo sapiens 

<400> 432 
gnngantnng tcttattatc gtggngctct 
aatteggcac gaggccaccg aagcttcagg 
tcaatgagtt aaagactgea gaacaaatca 
tctttgtcaa gattgtgggc cattatgett 
gccacttcca agaaagatcc ttctgtaagg 
tgaagaagtt tgtgaagaca cagctcttct 
agaatcctcc tgcaggctat ttccaacaga 
agaagaaacc aagggaaaaa actgtgaaat 



nactnnctct aaatanaatt 
atgacatctt agactctctt 
acgagcatgt ttcaggcccc 
cctatatcaa gegggaagea 
ctctgacctc caagaccaac 
cacttttcat ccaggaagcc 
aaatacttga atatgaggaa 
aagagctgtg gtgaataaga 



gtgttgnggg 60 

ggtcagggga 120 

tttgtgcagt 180 

aatgggcaag 240 

cgccgatttg 300 

gagaagagca 360 

cagaagaaac 420 

atgactagag 480 
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ctacacacca tttctggact tcagcccctg ccagtgtggc 
cttccaaaat ccatatcctc actctgagtc ttggtatcca 
gtctgagatt tggatccctg gnattggatt tcttaaggac 
catgcttcac agaacttggg cttcanaagc ttcanttttt 
ggaaaacagt tntncttgtt ttgtannt 



aggatcagca aaactgtcag 
ggtatttgtt tcaaactggt 
ttttggangg ctcttgacac 
tgcanaggtg ccccaggtta 



540 
600 
660 
720 
748 



<210> 433 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 433 

gggnaaaagt ttnnnannng ggnagnnnng ntnnaccntt cctattactt tggagctcga 60 

actcgcncca canannnagt gncntgngct gttttgcaga tgaggaaaac tgaggtacag 120 

aattcttagg gaacttaccc aaaatggctt ttctgcactc tgccctttgg tattgtccca 180 

tgtgaattgt ttaaaactta tgtgtatagt ggcatgagta ggtgatttca gaaacagaac 240 

tcacttttgt tgtttggtct taaaattagg aacttttctt catctgggct tcatttccct 300 

gcaccttccc agctttctag tcatgcaagc cacatgtctc cacgtgaggg gttcattgga 360 

aagcagccac agagccaccc cctggctggg ttcttcccca gctctgcttc ctccttcccc 420 

aagtcctgca gctgctctct ccatggcaga accacttctc cccttactgg aggggaggtc 480 

cactgaacaa atccaggaga ggaatcattg tgttttccac agaagagaaa gtacactgga 540 

ctttctgtgc aacctgttac tacattttca caganactca tatttgtgca ntgtaactca 6 00 

atttgaaacc cagcaaaatt aggctcccgt gtctccataa aaggccacca tgatggtaac 660 

cgttggactt caccttgtgt ttnggacana ngctgattgg attttaccca tcatcacanc 720 

cgtgtcttac attctcnttt cctgggcttt ggacccctgn tanaaaaan 769 

<210> 434 
<211> 764 
<212> DNA 

<213> Homo sapiens 
<400> 434 

ctanccttcc taaannctng gctactcgnt ctttctnnan ganncnnntg cgatncgaat 60 

tcggcacgag caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca 120 

gtgcaggaca caagctcatg cccccgtctt gccagtgaca cttgaagcct cccgacttcc 180 

acagagtgct tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt 240 

tttttgagat ggagtctcgt tctgttgccc aggctggagt gcagtggcct gatctcggct 300 

cactgcaacc tctgcctccc aggttcaagc gattcttctg cctcagcctc cagagtagct 360 

gggactatag acatgcacca ccacgcccgg ctaattttgt atttttggtc gagacggggt 420 

tttgccatgt tagtcangct ggtcttgaac tnctgacctc aagtgatcca ccactcggcc 480 

tccaaagtgt tgagatgaca ggcacgagcc accagcccaa cctgagtggt attttcttta 540 

gggaccangt agactttaaa acgagggtaa gagaaaaagc ccagtggtct tttctgangg 600 

taaataaatt tctgcccagg aaacnttncc aagccccaac cagcaagcca acccttaaaa 660 

aaaaaatcac ttcgtgttcc ccaangggan ctttnttaaa gctttggggg cttccaggna 720 

aaatcatttc cagtnnaant ttggaagaat tcannagnat ttnt 764 

<210> 435 
<211> 755 
<212> DNA 

<213> Homo sapiens 
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<400> 435 

gnnnnntttc taatgtggnn nngnnngnta annttctaaa ncttgggntc tcgttctttc 6 0 

tncagatncc ntcgattcga attcggcacg agggatcctt tccagacaga agaccccttc 120 
aaatctgacc catttaaagg agctgacccc ttcaaaggcg acccgttcca gaatgacccc # 180 

tttgcagaac agcagacaac ttcaacagat ccatttggag gggacccttt caaagaaagt 240 

gacccattcc gtggctctgc cactgacgac ttcttcaaga aacagacaaa gaatgaccca 300 

tttacctcgg atccattcac gaaaaaccct tccttacctt cgaagctcga cccctttgaa 360 

tccagtgatc ccttttcatc ctccagtgtc tcctcaaaag gatcagatcc ctttggaacc 420 

ttagatccct tcggaagtgg gtccttcaat agtgctgaag gctttgccga cttcagccag 480 

atgtccaagg gtgcctgggg aagagccact gcgcatgtta tctttggtgt tactccagtg 54 0 

ttgaacanag agctggtcag aggcagtgca tcgcanagag acattaataa gggaatcctt 600 

tgaatcccta ancagcanca gctttnctga nggggccnat gatgccagtg acctnttcan 660 

ggnaagtctg ggacattggg accaccctgg ggggaagaac ttgtgggatg tggcttttct 720 

tttatgaata aagtactttg agttggttgn aatcn 755 

<210> 436 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 436 

aaggctggnn nnngnnntgc nnnncttcnt attantctgg gggctcgtnc tctctcnann 60 

nagnnaggcg ntgngaattc ggcacgagct caagaaaagg agaaagtttt tttgtatgaa 120 

attggaggaa atattgggga acgctgcctt gatgatgaca cttacatgaa ggatttatat 180 

cagcttaacc caaatgctga gtgggttata aagtcaaagc cattgtagaa gacttaacaa 240 

gctgcagata accatgtgga cttctgtcat aattcttgct gagtcaagag tgtaaataaa 300 

agaaatggca ggactcatat tattcagttg tcccaagtat ttaaaaatga ctctcttaag 360 

ccttaaaaag tcatagattt gtgctgctgc cagaattata ttaattatta ttaatggtat 420 

tattagaaaa aaaatttctg gagtgagagt naagangctt aattagtttg tgggcagttt 480 

tcatatgctc tgtgaaatgt gtccagatgt gacataagtt ttttttttta atatggngga 540 

aatgncttct ctttcccatt cttttctcct aaaaatcata tatactggga atatatgcct 600 

ctnttacctc tattaccctc ctcacattta ccctttccca gttnggtttt gctttttnac 660 

caaaaagatt ccaatnccna ggtattggca agttntnaaa accgcccntt aaacatccct 720 

aatttcncag nattccnnnc ttgccaaatn ttngtntcnn 760 

<210> 437 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 437 

ggnnnnngnn ngntnncgtt ccctattant caggngctcg ntctntctcn annnancnng 60 

gcgtgtncga attcggcacg aggattttcg aaactcttca gctacttgcc cttttttatc 120 

tgaaaccatc ataccttctg aaagaaaaaa gcatatcttc attgacataa cagaagtgag 18 0 

atggcccagt cttgatacag atggtccatg atatatatgg agagtggcat tgtgaagata 240 

acatctttag atggtcatgc atacctctgc ctgcccagat ctcagcatga atttacagta 300 

cattttttgt gtaaagttag ccagaagtca gactcatctg cagtgttgtc agaaacaaat 360 

aataaagccc caaaagataa actagttgaa aaaactggca aaatctgtat acgtggaaat 420 

ttaccaggac agagactgaa gaataaagaa aatgagtttc attgccagat catgaaatcc 480 
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aaagaaactt taaagaagat gagttgtgta aatggaactg aagggagggg aagaactgcc 54 0 

ttcgcctggt acaaagcaca catgtgtata cacatgggtc aagcagtgct ggtctgtggc 600 

tgnctgtcca gangaatgga aatatccttg gctttagcac ttcattttca taataaaatc 660 

agcaattntg tctaaaaaaa aaaannnana aaaaactnga gcctntanaa ctntagtgag 720 

tcgtattacg tagatncnna catgataa 748 

<210> 438 
<211> 823 
<212> DNA 
<213> Homo sapiens 

<400> 438 

taatccttnn tattgntcgg gtactngntc tntctcnaag annntntcgt tncgcccagg 60 

tagctgagac tacccacacc ttggtcccag ctacttggga ggctgaggtg ggaaaatcac 120 

tttgcccagg aattcaaggc cgcagtgagc tatgattgca ccactgcact ccaggcaaca 180 

gagtgagacc ctgtcttaaa aaaagaaggg agaaagtgtc agatggtgat gaggtctggg 240 

ggggaaatag agaatgggga tcaggagtgt ggatggtggt attccctcac caagaggtga 300 

catgttgagc agggaacttg ggaggtgagg gtgtgacccg tgtggaaatc agggaaaagc 360 

attncagcct gagggacagc caatgcanag gccgtgaggt ggccagtgcc actgagcagt 420 

gagcttggga tagggggcan gtgangaggc tggagagcgg ggtcagacaa accaatatgc 480 

ttatttaaaa caaggttgtt ncagcaccct tgccttaaag ccttgagcct gnaanctnga 540 

aaaatttggg cacnttcaaa agcanggang gaaaccaaaa gaagattggg agggaaaagc 600 

ccttncnttc ccttancagg aaatgaagtt nccacccttn aaaacaggnc caggaccttt 660 

ttgggaccct tttggccttt tggttcctta gaatcctctt ggtngcttnn gaatnaaaag 720 

gnaaaagggg cctttaaggg gggatcccat tntttccaaa attcaaaggg ggctttccct 780 

gggcttaccc aaaatttctt ggncttaant aaaaaaattt ntt 823 

<210> 439 

<211> 767 

<212> DNA • 

<213> Homo sapiens 



<400> 439 

gnnnnngntt ctaatgctgg nnnnnnnngg taccctttcc aaaacctggg ctctcgntct 60 

ttctncangn agccnngcga ttcgtctgtc tggtgatttt tattttaagt gaacctttgg 120 

atctatcttt aactctcttt attgtgagtc taaattccaa ttctgcagca gatcagtaaa 180 

ctcacagtat ttttcctgtg gaaatctatt caataaggaa accaagacag gatantaaaa 240 

tttaaaaaaa ancaactttg aattcccctg cctaggtctt ccagttgttt tccagcgcat 3 00 

acctcaggta tgactttgct agccggggac aaaattagca ccttccgatt ctctagtcca 360 

aatgaacttt ggctaaataa aaaattatta tactacataa taaagttnca gatagcagga 420 

aatgcaagag ctaggagatt cctagattat atctggccaa gccaaatacc ttaaacatcc 480 

acctggaaat cctctacccc ctcttctgag ataatttgcc cagccctttc ttcccacaca 540 

ctcactcaat gtcaccccct tctaatcccc aaaactgttt ttgtggcctt ggtagcctat 600 

agtagtttct cacatctttt cccctanact tttctgtttt cagtttcaga ccaaaaaaac 660 

tcttcaactt ttttccagtg gggtcttcct taccagtaac tttaccactt gnaatcttat 720 

ttcattgaaa aaaccttaaa tgggntggga aaaggcttgc cnncann 767 



<210> 440 
<211> 752 
<212> DNA 
<213> Homo sapiens 
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<400> 440 



nagnnnnntt tctaatgctt 
tttctccaag atncnngcgn 
tcatcaagag gaccatcttg 
gggatttttt gtctgaaaat 
tagttcagca cttgatccat 
tgttagacat catttatatg 
gagtaaagga ccaggtgaag 
aagaaaattc ttcagagagc 
aatgcagtct ggattcgaat 
ctcctacgtg gtgtctactc 
cgcaatacac cgcttcttgg 
agaagagatc attttagatn 
aaataaaaat tcctacagtc 



ggnnnnnnnn tcnatgcttc caaaagcngg gngctcgttc 
tncgaattcg gcacgaggat ggatgagact gttgctgagt 
aaaatcccca tgaatgaact gacaacaatc ctgaaggcct 
caactgcaga ctgtaaattt ccgacagaga aaggaatctg 
ctgtgtgagg aaaagcgtgc aagtatcagt gatgctgccc 
caattcatca gcaccagaaa gtttgggatg tttttcagat 
atgttgacct ttttgatatg aaacaattta aaaatcgttc 
attaaaaaat gtgacagtca" gcttcagaga aactgaggag 
tcctggggaa cacagtacac aaagccaaac cagtacaaac 
ccagactncg tacgccttca cgtnctcctn catgctgang 
gtcangaatt agaagctact gggaaaatct accttccgac 
taccgaatga anaaagcttg cattagtgac attgaaaggg 
naaaaaaaaa at 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
752 



<210> 441 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 441 

gnagccngat tccaaaacct gggnnccgat ccaatgcttn ccaattactt gggagctcnn 60 

actngcncna ncaanctngc cntgcgaatt cggcacgaga agnaggcgga gcttgcagtg 120 

agctgagatc gcgccactgc actccagcct gggcaacaga gtgagactct gtctcaaaaa 180 

aaaaaaaaaa aaatggaacg cagggcaaga actcgtnttt ggaaggagat gggggaaagg 24 0 

ancggtatta tacctatgtt gnatttgcag gcaaatgaga tgganccctc tctgtaaaga 3 00 

agagtcattt gtgcaagtag acggggtctg tgggtgcang ccctggaggg gcacacaatt 360 

gcctgnangc ttctgtgana tcgggagang gaggagaagc agtctcttga caaaataaag 420 

tatttttatt cattngtatt tattaaatga aaaaacaatc ccatggtgtc ccttgtgtgt 480 

ggtggaacct aatgactgtt gaaataaagt ctgngttttc ccttcaaaaa aaaaacncnn 54 0 

anaanaaaaa ctcgagccct ntaaaacctn tngngagtcc gnattaccnt anatcccnga 600 

cnttgataag gatccattga tnaantttgn cccaacccca actnngaatg ccnngaaaaa 660 

aaattgcttt atttgggaaa tttgcnaatn ctttgcttta ntttgnaccc antttancnt 720 

cannnnccaa gttacnancn ncaattgcnt tcatttangg ttcaaggttc aaggg 775 

<210> 442 
<211> 804 
<212> DNA 
<213> Homo sapiens 

<400> 442 

gagnnngntt ctatacctgg gnncgatcca aancttncct attaccttgg atcttnngct 60 

atctcnaann aaaangcttn cgaattcggc acgaggccac ctgcactgag gtctgggccc 120 

ggggacaggg tgctttagcc aggcttgtct gcgcctcagg gaagggtgag cagcccaggg 180 

accagatgca agttggtggg cccctccacc cntccnacgc cactccccag tgtgctgggt 240 

cctaaccagt cgtcctatgg gagcagtcag ccttcctctc ctcctcaggg cagctctccc 300 

acctgctgnt ccccgcacac agaacctcat tgctctgagc agttgcttat tacccagttg 360 

ttgaaaaact agcatgtgan ggccgggcgc ggtggctcac gcctgtaatc ccaacgcttt 420 

gggangccaa ngcgggtgga tcatgangtc aggagatcaa gaccatcctg gcttaacacn 480 

gtgaaaccct gtctctacta aaaatataaa aaagtancca ggcgtggtgg tggcccctgt 540 

agtnccaann tacttgggaa gctnangcan gaanaatggc ntgaacccaa gaaggaagaa 600 
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cnttgcantg aancttaaaa ttgcgcccac tggaatttca aaccttgggc cnanaanaat 

tgaagaatcc cgtcttaaga aaaaaggaaa aaantttncc nttntnaaag gcccggccac 

aantnggctt taacgccctg gtaaatnccc aancactttt tggggaaggc ccaaaggcaa 
ggccnggatt caattttnna aggg 



660 
720 
780 
804 



<210> 443 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 443 

gnagccggat cttattattg gcnncgnttt aatgctggct aatntntcgt aatncttggt 60 

nnccccaann anrmaggngg ggngaattcg gcacgagcac catttttatt ttgatgctta 120 

cactcattta ttctgttttt gtaaaacagt ttcaagaatt taaaaatcct tccagttaat 180 

agagcttttg ttattatatt ataattttgt aaacccactt tgtttttccc actttaaagc 240 

cacagggtcg actcatggat gatacctcta ttgctgctgc atgatgttca agaccggccc 300 

ttggctgttg ttacagagat gttgggcaga gctatgcagg tgtttcattg ngaactctag 360 

ctttgatcat ggtaaaaagt taaccctttc tattttttaa tggatgttat accaactatt 420 

cagaggactc atacttcaaa aatattagga aaatctgtct tatagttctc taataaatat 480 

ctgaaatctc aagtacgaca tgaaagaatg tcagaccatt gntattggtg aaagtcattt 540 

gatgaatggn aaattctatg aaaagtaagt ggatttgcat ggattaatat cagggaaaat 600 

ttaagccttc ccaagtgtga ctgggccaaa gagagccaga tgcccccagt gcctgtgccc 660 

ataaagttcc cgaatccccc aatggggtct nttttcaaaa acttggncca gacccggaaa 720 

ataaaancat tcntcataaa ttcaannggg gncctcanga aacacnttcc cccancaacc 780 

cttngg 786 

<210> 444 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 444 

gnagnccggt tcnnangcnt nggctnnatc caatgctggc taaagttcna ananctggca 60 

acnccaggan ncangcgttg cgaattcggc acgaggagga attacaggta gcaaattatg 12 0 

gagttggagg acagtatgaa ccccattttg actttgcacg gaaagatgag ccagatgctt 180 

tcaaagagct ggggacagga aatagaattg ctacatggct gtttnatatg agtgatgtgt 240 

ctgcaggagg agccactgtt tttcctgaag ttggagctag tgtttggccc aaaaaaggaa 300 

ctgctgtttt ctggtataat ctgttgccag tgggagaagg agattatagt acacggcatg 360 

cagcctgtcc agtgctagtt gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 420 

acaagaattc gaagaccttg tacgttgtca gaattggaat gacaaacagg cttccctttt 480 

tctcctatng gtgnactctt atgtgctgat atnccatttc ctagtcttaa ctttcaggag 540 

tttacaatng ctaacactnc atgatngatt cantcatgaa cctcatccat gttcatctgn 600 

ggcaattgct taccttgggg gntcttttaa aaagtaccac gaaatcatca tattgcatta 660 

aaacccttaa aagttctggt gggnatcaca gaagacaagg ccnaanttna aagnggagga 720 

attttattat ttaaaagaac cttttgggtn ggatnaaaan 760 

<210> 445 
<211> 761 
<212> DNA 

<213> Homo sapiens 
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<400> 445 



tggtgccggt tcttantctg 
gccccnaggn tcccangnag 
aaagaacttg gtcataaata 
gtagccttca gttttgtgaa 
tttaaccaat gatggtgttt 
tggtggtcat tgatggcctt 
cttttaacca caatgaattg 
tggcattgct tccttaacca 
tattttttaa atgtaatgtt 
agtgccagtg gtagtaactg 
cacanccttc tgcctctgna 
aatntnaaaa tcatattntg 
aagattggat ttttattact 



ngctctcgtc ttccttctta tacctgggca 
ccnngcngng ncngattcgg cacgagattc 
tgataatgag aagacaaagt atttatatta 
aatagttttc agcacagaaa ctgacttctt 
gcttctagga tatacacttt aaaagaactc 
tagtaaattg gagctgctta atcatattga 
tccttaatta ccaacagtga agcactacag 
gctcatggtg tgtgaatgtt ataaaattgt 
atataagatg atcatgtgat gtgtccaaac 
tgtaaagttt ctaattcaca acnttaattc 
tttggaagtt gtcagtncaa ctcatcaaag 
ggaataattt ccctcttttg tagtctgccc 
atttaaacca gtggattaat n 



ncncttggcg 
caaaggtttc 
aaacagttta 
tagacaaagt 
actgtcccag 
tatctaattt 
gaggcaactg 
cactcagata 
tatggtgaaa 
ctttaaaatn 
aaaactgcct 
aagatcctta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
761 



<210> 446 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 446 

tgggnnngnn ccnaangcng gggannnggt ccccgttcca anactggaan ncttggcann 60 

cgaactcgct cnannagnaa ggccgggnga attcggcacg aggccccgct ccatgagcag 120 

tgactcccca gctcctcctg gcaccagtcc ccagggctct cctgttggta gttcctgctt 180 

ttcttcttgg aaattcctcg tggacctcga gatctttacc ctaaaatagt tctgttgaat 24 0 

ttcaccctgg caatgtaaat tgatagctta tcttcacaga tgccagacaa tggacaactc 300 

accatcagtc ctctgctcac ctgagacaaa tgcatgtctg attgcttcct ctgccctatt 360 

ggntatgtga aaatgcagat tcactgagcc agactaaggc atcagtgact ggtcctctac 420 

ctgcctctca catggagatt gggtattcag tgaaaggctg atcaaagacc caaaggaatg 480 

caacagttta tctcttatct acctatgacc tgcganctgc caccaccccc agntggngcg 540 

cctttccaga cagaaccagt gtacatctta cacgtattaa atngatgtcc cnggggctcc 600 

cnaanangna tcaaacaagc ngggcctcga ccaccttggg cacatatccc nanggacatc 660 

annctggagg ctngngncac tggcattggc cctnaccctn ggcaaaataa accttctaaa 720 

attggnaaaa aanaaanaan aaaaacctng nnccctntna naacnntacg 770 

<210> 447 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 447 

tggtatnntt tnaangctgg nngnnnggcn ccgttccaat gnctggganc nttggcaatc 60 

gctctttccg nangatccca tcgattcggt ctgatgcagg agaattgcta aaacccagga 120 

gggagaggtt ncattgagcc gagattgcgc cactgcactc tagcctgggc gacagagcaa 180 

gactccgtct cgaaagaaag aaagagaaag gaaattcccc agggaagtac ctcggcttat 240 

ttcataaaca ggtactgaag gaagcagagg catgtggagg acttccccac ctcgtgcagc 300 

tatttgggcc gtggcatctg aaatttctta tttcagagtc acccctttga tgaccttggc 360 

agtgaactgc agtcatctgt ttaggccttt ccatggccca cgtcaatgcc ggtatttctg 42 0 

tttgttgcac atttgatttc cttgttgttg gcatttagaa ggccccccgt ttcccagatc 480 

acaccacggg catggaccac agagattgca tcttgtgagt ctgtagaaat ggtcaaggcc 540 

ttgtcctctc ttaagtccag agctcangtt aatgcaaaat tttnccggnc atctgtgctg 600 
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aaatcccttt ggggaagctc ctggctggtt tcctgtaggt aggacagcta cacgtnctgc 
cctttattgg cttcttttca tgaagctcct gccatntacn aaacatgtct cccttcttga 
atcacatctc tggtattgna actctanaat cgcccgg 



660 
720 
757 



<210> 448 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 448 

gggtgnnnng tttctaatgc ttggtngnnc nggnccnacn tttctaatgt tcggaanggc 60 

ttggctactc gntctttctn cangnagccc ntcggtncga attcggcacg aggtgtcttc 120 

atcttaccca gtggaaccta agaaattaaa ttctccagaa gaaactgctt ttcagacacc 180 

aaaatctagc cagatgcctc ggccttcagt gccaccatta gttaaaacat cactgttttc 240 

ttcaaaatta tctacacctg atgttgtgag cccatttggg accccatttg gctctagtgt 300 

aatgaatcgg atggctggaa tttttgatgt aaacacctgc tatgggtcac cgcaaagtcc 360 

tcagctaata agaagggggc caagattgtg gacatcagct tctgatcagc aaatgactga 420 

attttctaat ccttctccat ctacctctat tagtgctgag ggtaagacaa tgagacaacc 480 

cagtgtgatt tattcatgga ttcagaataa acgtgaacag attaagaatt tcttgtcaaa 540 

acgggtgctg ataatgtatt ttttcagtaa gcacccagag gcctncattc aggctgtttt 600 

ttcagatgcc caaatgcata tttgggcatt agaaaggtct gtcgcactta gtagcagcat 660 

cattttacag aggatagatt tggagttgtc cagacgacac taccagctat ccttaatact 720 

ttgttgacac ttgcaagang cagtcngaca agtactttaa cttcctcatg 770 

<210> 449 
<211> 792 
<212> DNA 
<213> Homo sapiens 

<400> 449 

ngagaaangt ttctaatgct ggnnnngnna gntcancctt tctaatgttc taatacttgg 60 

ganntcgaac tntcncnaca cagnnangcn ntgcgaattc ggcacgaggn cnnctcnatn 120 

atnacttgnt cncancggnc tggcatcnac ncgncacacc tacntnagcg cnttgtagcg 180 

caatatncac ctnntnaaac ccnnnagtcc cagggctctg ccnnnnnact gntcaactga 240 

cnaacnacnn nctancncaa cntnnnntta ngccnctgnc tgnctctatg gcacctnncc 300 

tnccntcncn cntnacccnc tacgctcagg gctatataca atgggaacct tnccaacagt 360 

aanccntgga tctnaggnat ggcccttgnc tggcggatca cagccttnna gcntatcagn 420 

atcttgagga agacaccatt ccgtcccnga ttntgaccaa ncnctcggat gtgnctatgg 480 

gctcnattga ggnacaacaa ctnncactgc nnataggcca tcctcnnnan nctacacatg 540 

ngactttncn nnncatntna aatgnnnana tgtctctcnc aagcatcacc cnctgtccct 600 

ncgncntcnt ggaagacctt ctgnncaact ganctccttc ntgnnncnnn ngattnttnc 660 

nnncnnaata tncntncccc aatgnccttg tnnngnattt atnangggnt ttccaatttg 720 

ggntaattca ntncccnccg nannctannn ncccatnaac cntcngngcc ttcttgnaac 780 

cttttnncct gg 792 

<210> 450 
<211> 848 
<212> DNA 

<213> Homo sapiens 
<400> 450 
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gnatgncccg atttccttaa tgatggggnn nnnnngagcg anncttccga aanttccaat 60 

annctgggng ntcgcaactc nctcnanaca gnaaggncgn gggctttgct ctctccattc 120 

caagttgntc tctgttctag aaagcagatg tagtagacat ctactgttgt tgcctgaaca 180 

gaatcccttt gtcctttttt tgntaaaagt actcatccct aatattcatt gtnctggaag 240 

gactgaaaat acagaactca caccatgatc ggccgggaca atcagattat ttcattccnc 300 

agcaaacgga gatcganccg aaaagtggaa anatgagcnc ttctttggng ttggcatatg 360 

gaccctgaga gaaagaactn tnattntttc tcttggactg caataaagta tagctgccta 420 

aaatacgntt cctgacactt ggaggnttgt ccacaatcgg ngaaataaag gcgagaccgn 480 

acactggatg aaaaaaanaa gnnnccngnn gaanacccac tnnnccannn nccnnnccnn 540 

tncnccanng nnganccnnn tanccgnnan naggccnnng cnntngcnnc nnngccnnnn 600 

nnnnnngggn aaacccnnnn gnnnnncnnn nnnnnnnncn nnnnannnnn nnncnncnng 660 

nnggnnctnn nnnnannnnc cccnncnncc cnncnccnnn nggnaanncc nnnnnnnann 720 

annnngggnn nnnncnannn ccnnnnnnnn cannncnncn cnnnnnggnn nnnnncnnnn 78 0 

nnnnnnnnnn ncnngngnnn acnnnngngn nnnnccnnnn nnnnnncngg nnnnncnnnn 840 

nnnncccc 84 8 

<210> 451 
<211> 765 
<212> DNA 

<213> Homo sapiens 



<400> 451 

gnnnnnnntt tcctaaatgc ttgggnnnnn nnngagngnn nttncnnagt ttcctaanta 60 

gcttnggcna ctcgttctnt ctncangcag nnnntgcgtn gncgaattcg gcacgagcat 120 

tcctcctttg ttaacgaagc aacatttaca caagatggac attacattat tagtgcatcc 180 

tctgatggca ctgtaaagat ctggaatatg aagaccacag aatgttcaaa tacctttaaa 240 

tccctgggca gcaccgcagg gacagatatt accgtcaaca gtgtgattct acttcctaaa 300 

aaccctgagc actttgtggt gtgcaacaga tcaaacacgg tggtcatcat gaacatgcag 360 

gggcagattg cagaagcttc agttctggta aaagagaagg tggggacttt gtttgctgtg 420 

ccctctctcc cgtggtgaat ggatctactg tgtaggggag gactttgtgc tctactggtt 480 

cagtcagtca ctggcaaact ggagagaact ttgacagtgc acganaagga tgtgattggt 540 

attgcacatc accctcatca gaacctgatt gctcctacag tgaagatgga ctcctaaagc 600 

tctggaaacc ataattcaac ttttcttttt taaatcaact cgaaagcatg tncttaaatg 660 

aacatattca tgtaangggc tttttttttt tgncactttt ctaagcaaat agatggctga 720 

attagtcacn gaataaattt gngaaaatca tggttaaatn ccaac 765 

<210> 452 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<400> 452 

nnngnnnnnn ntttcctaaa tgcttggggn nnnnnngngn nnnnnttctn atgttcctan 60 

ngcnnnggng ctcgttctnt ctncacgnng ccngtgcggt gncggtctga ttgaaagctg 120 

ttcaggttta tcatgcaaat cctcgcctct ggctacggct ggctgaatgc tgcattgctg 180 

ccaataaggg gacttctgaa caagaaacta aaggccttcc cagcaaaaaa ggaattgtnc 240 

agtctattgt tggtcaagct atcatcgtaa aatagttttg gcatcacagt ctatacagaa 300 

tactgtttat aatgatgggc agtcttcggc cattcctgta ccagtatgga gtttgcagcc 360 

atatgtctca gaaatgcctt gttgctgctc ctgaagaaca gcaagatcca aagcaggaaa 420 

atggggctaa aaatagtaat caattaggtg ggaacacaga gagcacgaaa gcagtgaaac 48 0 

ttgcagcagt naaagccatg atggagatna attcattcca gcttcacctt cttctccatt 540 



115 



WO 99/33982 



PCT/US98/27610 



gagaaaacag gaattagaaa acttaaagtg ctccatactt gcttgcagtg cctacgtggc 
tctggctttt gggtgatacc tcatggcttt gaatcatgcn gatnaacttc ttcagcagcc 
caagctgcag gatctcttaa gttttgggac atttatatgc tgcagaaccc ttatcttctt 
cgacngaatn tctgtgccnt tctcacttga ccccgagaat gtnct 



600 
660 
720 
765 



<210> 453 
<211> 833 
<212> DNA 
<213> Homo sapiens 

<400> 453 

gngtnnnnnt ttctaangtt cntaatannn tnggctactc gttctttctc caggnatccc 60 

ntgcgattcg aattcggcac gagagaaacg ttctcaggtt gaccagctgc tgaatatttc 120 

tttaagggag gaagaactta gtaagtcatt gcagtgcatg gataacaatc ttctgcaagc 180 

ccgtgcagcc cttcagacag cttatgtgga agtcaagagg ctacttatgc tcaagcagca 240 

gatactatgg agatgaatgc actgaggacc catagaatac agattctaca ggggattaca 300 

agaaacatat gaaccttctt gagcacccca ggttttggca ttagaaaatg ggtacccctt 360 

ggttcaaaaa tgaacaaaga aagccttaga tttggatggg ggaacctgat ctgtccagtc 420 

tagaaggatt ccagtgggga aagtgtttcc atttccttng tcccctggct tggccaagga 480 

aagcgaaagc cctttcttga anagcaaccg tggatcattn gaccaggaac ttccttctgg 540 

ggtattaagc ttctttcaan tggaagggaa aggttccang gccaaaggaa aaatggaagc 600 

ccccaaccng atgggtttca ccctaantaa cctcaattgg aagggcttgg accaagaacc 66 0 

cnggaaaggc nanccattgc acccttaaaa ncaaggaaag tggaccacct ttggggcttg 720 

ncnttccntt ccgaaccagg ttgaaaangg gcttgaaaaa tggttgctta cccaaaaggg 780 

cgnacnttaa tggcaccaat tattcctntg gaccnttttt aatanccttt ngn 833 

<210> 454 
<211> 737 
<212> DNA 
<213> Homo sapiens 

<400> 454 

gnggnnnntt ctaagttcct aatnctgggc tactngttct ttctgcaggn atcccatcga 60 

ttcggcaaaa tcaatgtgga ctgaacataa atcacctgat ggaaggactt actactacaa 120 

cactgaaacc aaacagtcta cctgggagaa accagatgat cttaaaacac ctgctgagca 180 

actcttatct aaatgcccct ggaaggaatn caaatcagat tctggaagcc ttactattat 240 

aattctcaaa acaaaagaat ctcgcttggg ccaacctaaa gaacttgagg atcttgaagc 300 

aatgatcaaa gcttgaagaa agcagtaagc aagaagagtg caccacaaca tcaacagccc 360 

cagtccctac aacagaaatt ccgaccacaa tgagcaccat ggctgctgcc cgaagcagca 420 

gctgctgttg ttgcagcagc agcagcggca gcagcagcag cagctgcagc caatgctaat 480 

gcttccactt ctgcttctaa tactgtcagt ggaactgttc cagttgttcc tgacctgaag 540 

ttacttccat tggtgctact gntgtagata atgagaatac agtaactatt tcaactgagg 600 

aacaagcaca acttactagt acccctgcta ttcaggatca aagtgtggaa agtatncagt 660 

aatctggaga agaaacatnt taaccaggaa actgtanctg attttacttc caaaaaagaa 720 

gaagaggaga gccacct 737 

<210> 455 

<211> 718 

<212> DNA 

<213> Homo sapiens 
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<400> 455 

ggnnnnnntt tnnncngttn cntaaaannc tgggctactc gttctttctn cangnagccc 60 

ntgcgatncg aattcggcac gaggatgagg agtgtttaat cattgataca gaatgtaaaa 12 0 

ataatagtga tggaaagaca gctgttgtgg gttctaactt aagttccaga ccagctagtc 180 

caaattcttc ctcaggacag gcttctgtag gaaaccagac taatactgct tgtagtcctg 240 

aagagtcatg tgttttaaaa aaacctatca aacgagtata taaaaaattg atccagttgg 300 

agagatttta aaaatgcagg atgagctctt aaagccaatt tccagaaaag taccagaatt 360 

gcccttaatg aatttagaaa attctaaaca gccttctgtt tctgagcaat tgtctggtcc 420 

ttcagactcc tctagttggc ccgaaatctg gatggccttc tgcatttcag aagccaaaag 480 

gacgattgcc atatgaactt caggactatg ttgaagatac atcggaatac ctagctcctc 540 

aggaaggaaa ttttggttat aagttattta gcctgcaaga cctgttgttc tcgtcgctgc 600 

agtgtncaga ggatagagnc agaccacgtt ctaaaacnga gaaatcagaa gacatttnca 660 

gttatgtctc caaaagtgag tntcagctgt atgagttgac tctgctgaaa gtgacttg 718 

<210> 456 
<211> 739 
<212> DNA 
<213> Homo sapiens 

<400> 456 

gtggnnnntt ctnngtttcc aatangntgg gtctcgttct ttctnnacga tcnnntgcga 60 

ttcgcttggg aggctgagtc aggagaaatt gcttgagccc aggagatgga ggttgcagtg 120 

agccaagatc atgccactgc actccagact gggcaacaga gggagactcc gtctcaaaaa 180 

ctaaaaaaaa aaatncattt agtataccgg ggggtggggg ggagaaataa tgttatttcc 240 

tatgcgaaat gacgtgtatc cctgtaccca tgggtaaatg taaatatact gtgtctcttt 300 

tgggagagcc ttttagtaga ggagtcttat atgaagtctc tcataagtag ttcacttgag 360 

ttttgcagtt tgaaatctta aaggagcttt aattgacatt tattatacca attaagcttg 420 

gaatggggca atggatgcat ttccaaaacg tgtgaaagcc taacagctta tattgctgaa 48 0 

tgagaatctc ctgggtgtaa tttancactt agggaactgc gtgaacactc ccagccatta 540 

tgatgctggt accagcttta ntgtntaaat gccatganta ttctttctgn tctgttttgt 600 

gctctcttgg tncatttatt ttacccttta cngaataatt tcttgtaaaa tccntaaaaa 660 

tntttggcat ttaaaagtcc nntcttggan tnaananann nnnaanaaaa ancttncccc 72 0 

tttanaactt tngngggct 739 

<210> 457 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 457 

gtgnnntnnt tctnnngttt ccaattantc tgggngctcg ttctttctcn anncnnnnan 60 

tggttgncga attcggcacg aggnnanagg gnagctacat gnntnaccnt nttngnnctc 120 

tcagccangc tcnnctnnnn ctggtcctac tgctacatag aacacttgtt ntncnnggna 180 

actnntntat gtnnccnnga ntctctgnna ctngtttaaa tgctanttga taacaggcta 240 

tgcaaggnct gnaagtggan agcgtcatca ttcatcatnc ntnttanctn gantnnntgt 300 

atcctacatg ctttgattgg taaatgngcn tcagactggg actctcaata aatgnatata 360 

gangancttg ctgtggaaan ctgtcctctc ntatctntnc atgngnaant tccactncag 420 

tntgaactcc aaatgcnntn atnggnganc cctncttgta tagtggtgtc cattccaanc 480 

tgcnagggnc tagaaaccgt cggctntngg aaacnatggt gnnagttgan ctggtacang 540 

cngttntcac ctgcanctac cataaaatgg gnttacccaa gctttatcat ggaatggnta 600 

taaaaaacgc attnattgng cctttntaan cccattatnt gttnaatttn acttatggtt 660 
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ccccccattn aaattatnca attgggnann gangcttcna gtcnccatnt ttnaatggnn 720 
ttnncaaaaa aacgnttttt ttt 743 

<210> 458 
<211> 906 
<212> DNA 

<213> Homo sapiens 



<400> 458 

gnngnnnnnn nttnctaatg cttgggnncn cgtttctann nnnnnnnnaa nntttcctaa 60 

ttggttaggn gctcgnnctn tctccacnna gnnnngcggt gcgaattcgg cacgaggctg 12 0 

aatcaaggat cacaaactnc acatttngca cnttggtctn cacatncntg gttngggcag 18 0 

tcncagtnaa catggctntg gaaactnatn ttngnctngc ntcaaccatc tcgttcccng 240 

gggacccann ntccnnnatc ncgnnttncc tcgnnnatng gagngctnct tngnccannn 300 

atgggctccc nanaatangn ntnncnnngn nnatncancn ncngncaann ggntcnncnt 360 

nnrmnngccc tnntncctna tggnnngctn catgncccat nnnnnnggnn ancaataann 420 

naaanggtct nttcccncga nncccnnnnn ccnctaacan ngnacctcgc aaagggcccc 480 

aggcnttnnc tngnaaacca nnttngccaa nggtanttca aaggnngcct tngggacctc 540 

ccnannnngc cntggnnnta ccccgggnaa anggtngnaa acccnnccnn ngntgccnnn 6 00 

cccggncnng gaaanaaatt tccnnggnac ccagnntncc nccgnaannn anantannnc 660 

ccancccnaa cnttnngccc ncanccnttn gnnnntgnan tcnncnnncc ctttnnnntn 720 

nccaanncgg ccnggnnacn nncttnnacc tntttncncn naanngacnc caanntcctn 780 

nannaaaggg nggnnnnnnn nnncnttncc nnnggnagcc cnnnnnncct nncntnncnn 840 

aaaaattcnn cnntgnancn cccctnnnnt nangngnccc natnnnnnnn nnngnaaanc 900 

nnaccc 906 



<210> 459 
<211> 765 
<212> DNA 

<213> Homo sapiens 



<400> 459 

gnngnnnngg nttcctaang ctggggnccg ntctnnnnnn nnnnnnnngt tcctaaanac 60 

ctaggngctc gntctngctc cacgcagncn gggcgtgggc gaattcggca cgagcttctg 120 

ttgattggtt tgtttaaagt acctaagtac taccctttga ctccctacca aaagttcttt 180 

tgttttttaa acaactttta tttgtgactt actttcttga gaagngttct taatgaattg 24 0 

cataaaatag tggtagcagc ttatttctta agtncttnat tattggggct ttaccattca 3 00 

ggtcttatct ttaaccctta tttactcagt tttccatctg aatgatccta tctctaaatn 360 

aaggatttaa taaatgctgc aaattgtcca ctttgcaaat ngtccaaaag ctttagtttt 420 

ggaccttgng aacttttttt ttaataacac attatttggg cccggtcgtg gtggctcaag 480 

cctgtaatcg cagcactttg gaatgcctag gcagacagat cacttaangc ctggagttcg 540 

agaccagcct ggccaatgtg gtgaagacct ccggttctat tactaaaaat nctaaaaaat 600 

tancaaggca tggnggtgca cgcctgnaat ctcagctact tgagangcaa atcnggagaa 660 

atgcttgacc ngggangcan anatganccn anattgcacc actgcattcc acctggggan 720 

nanantgaga anctggctca aaacccaaaa acccaaaaaa aaaan 765 



<210> 460 

<211> 677 

<212> DNA 

<213> Homo sapiens 
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<400> 460 

gtttncgctg ggagccacca acatagcaga ttaccatgtg aagttgccac tgctgcatct 60 

cctgaaacct ggctgatggg agaggtctca ttttgtgtct gagaatgtcc aggttgtctg 120 

cagaccacag cactgatttc ccattagcag ttattatttc ctggccattt cttcctgaag 180 

gttttgtggt taaactccct gtcctcaata ttttatcagc agtagggctg tcattcttct 240 

ggttatcaac ctctacatta tgaagtaagg ttcaaccctt ctgcttttct caggccccca 300 

aaacggttcc tatccaatcg aacacaaaaa cgggtattga gaaggaattg gcagggctca 360 

gtggctgttt ccgttgctcc tacctcatgg agactcttac tcatgctgga tttattgaga 420 

gaacttctaa ctgaccactc acccccaccc actcttatgc agtctgttca ttcctgaaaa 480 

caccactttc atccctcctg cacacaaccc atgagggatt gctacttcct ataagattcc 540 

tcagtgagcc ttatagagtt gctgcgagaa ttacatttgg tcatgatgtc aagtgtctgg 600 

tatgtagctn atgcttattg aacacatagt aatttattgg aataattgnc atgatcactg 660 

gatgagaata tagcccn 677 

<210> 461 
<211> 787 
<212> DNA 
<213> Homo sapiens 

<400> 461 

gnnnnnnnag ggnnnngngg ggcctcncaa agcccgngcn acaggtcccc gttccaaagc 60 

ntgggnganc gcnncgcccc ancagnaagg cgggggaang cggcacgagg acatcatcnn 120 

cttattctag taagagaaag tacacagatt caactttaga gaggacnggg gggnnnncng 180 

gagcnaaatc aaggaaggan tatcacgngg ccncccnnga atataannnn gaagctgnga 24 0 

acagnaccat cagnaacann nnatggacag ctctgatggg gnnnatacca cggcactctn 300 

cnnaccnnng gnggaagcna tccggagnna tgactgangn gnaaagnggn nnactgnnag 360 

aanccnggng ngctaggann ctgggagagn cactttcang aagnnaccng gcgangagnc 420 

atcanaagaa cccgganaag ngagaagacn ggaaaaagnn cncancgnac ngagcccagn 480 

nannnncnct gagccanggg ctncgaaang ccccaccnga agcnccatca canggnacaa 540 

ggnnngggaa aaggaancna cnnngcngac angnccnccn aanagngcca aancacngcn 600 

nngcccncnc gcccaaagaa nacnggacng cnggcncnna ncanaaggag cncnanggcc 660 

cnnggnaang aaactncnag nagcccaanc ccaaaggccc cnanggannn ccnncaaggg 720 

gaaaacanna nncacccaag gggcctgggc naanaaggcn ncccacncng gcccnccnnc 780 

nnnaccg 787 

<210> 462 
<211> 747 
<212> DNA 
<213> Homo sapiens 

<400> 462 

ctaatggctt ggnnnnnnng nnnnccgntt cttaattgnc ttgggcnnct cgctctntct 60 

ccannnagnn nntgcgttng cgaattcggc acgagcctca gccccacacc agctctattt 120 

ca 99ggtgag agtcagagag cactgcaata tgtgcttcat gggatttcga ttcgaagatc 180 

ctagaccagg gagacactgt gagccaggga tacaacaaaa tactaggtaa gtcactgcag 24 0 

accgacctcc ctgcagtttg ggaaagaagc tgggtttgtg gagaatcaga gcatcttgac 300 

atgactgctg acctaaagat ccctggcatt ggccagggat cctgtggaac ctcttctagt 36 0 

tcaggggtgt gagcattaga ctgccagttg tctagtgaca tctgatgctt gctgtgaact 420 

tttaagatcc ccgaatcctg agcacctcaa tctttaattg ccctgtattc cgaagggtaa 480 

tataatttat ctggatggaa attttaaaga tgaatccccc ttttttcttt tctnctctct 540 

tttctttcct tctccctttc ttctttgcct tctaaatata ctgaaatgat ttanatatgt 600 
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gtcaccaatt aatgatcttt tattcaatct aagaaatggn ttaagttttt ctctttagct 
ctatggcatt tcactcaagt gggacagggg aaaaagtaan tgccatnggc tccaaagaat 
tnntttatgt tttagctatt taaaaaa 



660 
720 
747 



<210> 463 
<211> 750 
<212> DNA 

<213> Homo sapiens 
<400> 463 

tncctttcta angcnnntng nnaanngtcn ccgttctaan tncttgggca gnncgctctn 60 

tctncannca gncnntgcgt tgcgaattcg gcacgaggcg agatgaagct acactgtgag 120 

gtggaggtga tcagccggca cttgcccgcc ttggggctta agaaccgggg caagggcgtc 180 

cgagccgtgt tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc 240 

ctgctcatct ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt 300 

gagcaattct tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct 360 

cctgtggata tctgtctaag taaggattcc atatggctct catatcattc cattccatct 420 

ctgccaagat ttggataccg caaaaatttg tgttngngga agattctgnc tgaactcttt 480 

cattcaagga actactacca tgaatctgca ttctgntgcc cacactgagg ncttagtaga 540 

taattgggtg gtctgaaaca cctattatct cttatntctg gtctctangc tggnatgtta 600 

attcctctga aatgntaaaa gtaatgggtg anaccngaaa aagaaatttc aatnacagat 660 

caanntgggg ngcatgtatn attttcaagc gtcaaaatgg aataagggaa gantnctgga 720 

tacctgcttg gaaaaggaag natgtgtatn 750 

<210> 464 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 464 

gnngtgtctt tgnaaagcct ttggggaann gncnccttct aatgcttggc tatcgntctt 60 

tacgcagnnc ccatcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 120 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 180 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctcc gtcgtggatt tcgcggcgat 240 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 300 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 360 

tgggatcggc ccaaggccag ttctccgcag gttggntgct tctttcgttc tctcctctgg 420 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 480 

nccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcaaatgg agacgcagtg gtggccagag ggtgatggag 6 00 

aagacgggaa aagcgacagc cacgctnctg gcttgaagcc gcaggacgca aataacttac 660 

tttggacctg acagttctac gttgntgtgg angccctgtt tcctggaaat aaaactcaaa 720 

atggtggttt tttggaaaaa aaaaaaat 748 

<210> 465 
<211> 863 
<212> DNA 
<213> Homo sapiens 

<400> 465 

gggnnnnnnn aanggnnnnn ggnnnnngtc ccgttccaan gaccnngaga tcgnngncgc 60 
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tccanaagaa aggcggtgng aattcggcac gagacctgta ccgcctggcc actggctgtc 120 

accggcgtga tgagctgccg gtgtttgaac gcngcctatg cngggacttt cccggcanan 180 

nggcnngaan atggccncca tncaggaagc cgcccagaac ctcctnggnn acacnacttn 240 

agngccttcn agtccgntgg nacccggncc aagccccggc aancnctgcc ccgggtcncc 300 

gttcccaagg ccaaccagcc ctgggnaccc ccggggagcc gaaacnctgg ggctnggana 360 

ccgngantga gagncncact tttcnntgta nacacgggcc cagganacan ctntgctcgt 420 

ggccccgggg naaaimnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 66 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 

nnnnnnnnnn nnnnnnnnnn ncc 863 



<210> 466 
<211> 713 
<212> DNA 
<213> Homo sapiens 



<400> 466 

ngtctttcga gcntggngnt cgttctngct cnannanatt ggttgnggga attcggcacg 60 

agcctcagcc ccacaccagc tctatttcag gggtgagagt cagagagcac tgcaatatgt 120 

gcntcatggg atttcgattc gaagatccta gaccagggag acactgtgag ccagggatac 180 

aacaaaatac taggtaagtc actgcagacc gacctccctg cagtttggga aagaagctgg 240 

gtttgtggag aatcagagca tcttgacatg actgctgacc taaagatccc tggcattggc 300 

cagggatcct gtggaacctc ttctagttca ggggtgtgag cattagactg ccagttgtct 360 

agtgacatct gatgcttgct gtgaactttt aagatccccg aatcctgagc acctcaatct 420 

ttaattgccc tgtattccga agggtaatat aatttatctg gatggaaatt ttaaagatga 480 

atcccccttt tttcttttct tctctctttt ctttccttct ccctttcttc tttgccttct 540 

aaatatactg aaatgattta gatatgtgtc aacaattaat gatcttttat caatctaaga 600 

aaatggttta attttttctc tttactctat ggcanttcac tcaantggac aggggaaaaa 660 

agtaattgcc atgggcttcc aaaagaattg ntttatgntt tagctatttn aaa 713 

<210> 467 
<211> 732 
<212> DNA 

<213> Homo sapiens 



<400> 467 

gnnnggtnnt ctaatncttg nnnnnnnntc ncccttctaa gccntggnct cgnctnnccn 60 

acnancnggc ttncgaattc ggcacgaggc gagatgaact acactgtgag gtggaggtga 120 

tcagccggca cttgcccgcc ttggggctta ngaaccgggg caagggcgtc cgagccgtgt 180 

tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc ctgctcatct 240 

ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt gagcaattct 300 

tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct cctgtggata 360 

tctgtctaag taaggattcc atatggctct catatcattc cattccatct ctgccaagat 420 

ttggataccg caaaaatttg tgtttgtgga agattctgtc tgaactcttt cattcaagga 480 

actactacca tgaatctgca ttctgntgcc cacactgtgg tcttagtaga taatttgggt 540 

ggtctgaagc acctattatc tcttatttct ggtctctagg ctggtatgtt aatcctctga 600 

tatgttaaaa gtaatgggtg agaccngaaa aagaaatttc aatacngatc aantttgggg 660 
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tgcatgttga atttgcaacc tcaaattgga gtaagggaan attctggata cttgctggaa 
aggaggaatg tn 



720 
732 



<210> 468 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 468 

gnnagnnttc taatngcttg tnnnnnnrina gacgttctaa nnctttggcn atcgttnttt 60 

ctncagnann ccntcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 120 

tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 180 

tctcaccgtc ctcaccccgc gaggcccggc ccgctcctnc gtcgtggatt tcgcggcgat 240 

ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 300 

ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 360 

tgggatcggc ccaaggccag ttctccgcag gttggttgct tctttcgttc tctcctctgg 420 

gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 4 80 

tccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 540 

atctgctgct ggggttcgcg gtgcagatgg agacgcantg gtggccagag ggtgatggag 600 

aagacgggaa aaagcgacag ccaagctcct ggctgaaacc gcaaggacgc aaaataactt 660 

actttgnacc tgacagtttc tnacgtttgt tgtggangcc ctgtttcctg ggaaataaac 720 

tcaaattggt ggtttcttgg aaaaaaaa 748 

<210> 469 
<211> 776 
<212> DNA 

<213> Homo sapiens 
<400> 469 

gggngntcta atgcttgnnn tgattctccg tctataacng gntaatnctt ggncctacna 60 

aaaggctang ngaattcggc acgagacctg taccgcctgg ccactggctg tcaccggcgt 120 

gatgagctgc cggtgtttga acgcaaccta tgctggactc tcccggcaga ctgcctggat 180 

atggtcgcca tgcaggaagc cgcccagcac ctcctcggca cacacgactt cagcgccttc 240 

cantccgctg gcagcccggt gccgagcccc gtgcgaacgc tgcgccgggt ctccgtttcc 300 

ccaggccaag ccagcccctt ggtcaccccc gaggagagca ggaagctgcg gttctggaac 360 

ctggagtttg agagccagtc tttcctgtat agacaggtac ngaggatgac ngctgtgctg 420 

gtggccgtgg ggcttnaann tnannnnnnn nnnccnnnac caantctncn nannannnnn 480 

ccnacnnnta aaantnncnn ncnnnnncan nnnnnnnnnc cnnnnanncc nnnncttnnn 540 

naancnnnnn nnnnnnnanc nnnncancna nnnnnnnnna nnncnnncnn nnnnnnnncn 600 

nnnnnnnnnn nannccnnnn nnnnantnnn nnannnnnnn nnnnnnnnnn aaannnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncncnnnnn nnnnnnncnn nnnnnnnnnn 720 

ccnnnnnnnn nnnnnccnnn nnnnnntnnn nnnnnnnntn nnnnnnnnnn nnnnnc 776 

<210> 470 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 470 

tatgnntttn ctaaaatncn tgggcaanac gtcctcnctt tcctaanagn ttnggcanaa 60 
cccttggcaa nacgccngtn acccanacnc agnnnggccg tggcgggcga gcgggcaaca 120 
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gctcttgagg agtgagactg 
tgctgagttc atcaagagga 
gaaggcctgg gattttttgt 
ggaatctgta gttcagcact 
tgctgccctg ttagacatca 
ttntcagatg agtaaaggac 
aaattcgttc aagaaaattc 
aactgangag aatgcannct 
ccagtcaaac ctacctacgn 
tccatgctga ggcgcaatca 
acctncgaca agaaggggan 



cnggagatnt gggccgtgcc aaagagatgg atgagactgg 
ccatcttgaa aatccccatg aatgaactga caacaatcct 
ctgaaaatca actgcagact gtaaatttcc gacagagaaa 
tgatccatct gtgtgaggaa aagcgtgcaa gtatcagtga 
tttatatgca atttcatcag caccagaaag tttgggatgt 
caggtgaaga tgttgacctt tttgatatga aacantttaa 
ttcanagagc attaaaaaat gtgacagtca gcttcagaga 
ggattccaat tgcccgggga acacagtaca caaagcccaa 
gggggactac tccagactcc cgnacncctt cacgtcctcc 
ccgcttctgg gncaagaagt tanaaacnct gggaaaaact 
catttanatt tacccnaaat gaana 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 471 
<211> 820 
<212> DNA 
<213> Homo sapiens 

<400> 471 

cnnnnngggn nnngnggcgn cntccnaaan ccggggcgac agngccnnng ttccaacaga 60 

ccnggngngc cgncngngcc ccanacagca ngggnggggc nnnggggnnn cnncgncnnn 120 

cnnancnaca aagaactcaa caagaaaaaa acnaacccca caagcgggca aaggacgnga 180 

acagacantn cccaaaagaa gacatacaag caaccnaaaa taatcnaaaa taagnnncaa 240 

aaagaaaaaa ngcnagacag agnngngana gnactnagna aaaagngana tctagcggcn 300 

annagnangn nngnnnacgg ncngnnncna agaaanagnc nctggnnccc aagcnggagn 360 

acagcggcgc aagcnnggcn cactgcaacc gcgaacnccc gggctcaagc gaaccnccag 420 

cctcagcctc ccaagnagcn gnnaaaggca ngcaccacca cacccgacna aaatancngc 480 

nancaanaac ananaanggc nccccngngc nnanncagga aanaaacacn cnnangcnnc 540 

ngaaaanaan naancncncn cnnnacaaaa aaacnnnagc cnnagaacaa nnnnggaggc 600 

ggaanacggn nnancccgac anganaanga nacnanngan gganganngg gaccaaaccn 660 

cancccggga anggcnnngn aaaaaaaang ccnnnaaann gggggaaaan ncggngnang 720 

ccnaaagggc cnnaaanggg gaaacccnan naaaangccg ggcanannan aaccnagcnn 780 

nancnanccn nccaangggg nannnccncn nncnaggccg 820 

<210> 472 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 472 

gnngtgtctc taatgcttgg ctactngttc tttccgccga acncttgcta atgcttggcn 60 

ntcgttcttt ctccacnnac nnngcnntnc gaattcggca cgaggtcaca ganatnaaag 12 0 

tccaatcata ggggctggnc cnacntctnt gctnntccct gcangantca tangatcagn 180 

nanaccgtgc gnntttgnaa gcntttcaaa tgtgntacca tcgngttact tncnnnggca 240 

cctgntgann tnggttgnac tnnncnggat nctccaaanc caccnnnncn atgggntnng 300 

tgngcatgng ntggnncann nacagannna ganactttaa ngaannngnt tntgcaaccn 360 

tnggnnctag caancntgan antnccaggg nnggccacna agctgaaaat nnatgttana 420 

ncnnatgntg naatctctag natgacttcc ncannnancn aaactnangc anggctgcna 480 

tgttagaanc tanaggccna atttcttntc natgnaacca ntntatgctt ttaagaccnt 540 

caactgtnnc natgaagccc atntacatna ttncggtaat anggctatnc ttaaannnaa 600 

ctgctgaaaa tnatgatnca nctacgaaat cctnncancc ncatntggct naatcattac 660 

caaccatttg acaccnncat ngnctaccca cntgcattnc catgaccnan tccantgcca 720 
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cccgcncaga tntacctt 738 

<210> 473 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 473 

tatgnntncc taanagagtt ntgnnacacg gcccgccttc tnaaancttc ctaatncttg 60 

ggcgctcgtt ctntctncac ncagnnnntg cggtncgaat tcggcacgag gtccttttga 120 

accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga ctgttctact 180 

ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag gcccggggcg 240 

accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc gaatttcgac 300 

ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc caacccgaga 360 

tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc agccaggtgt 420 

gtgccacacg gcagccttcc cgaaacatag tatggatttt aaaaatgtgt ntatttttgg 480 

ttctcaacca ctttataacg tattttttaa tttattttgt aatgtcttgt tttgaagtat 540 

tgctgctatc cttggtatcc ttcccactgg ttttatcact ganttatttt gngaaagttg 600 

ncactaatgt tctatgtcaa aatcaaaagt atttaatgaa atactanntc tatttaatgt 660 

ggntatggaa ccagctggaa acacaaaaca aacagtgatt gacancaagc tgggcccaag 720 

agncaggtca ttttgnacat atgccaataa ac 752 

<210> 474 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 474 

ttgcanacnn aatanttgct gtaaaagtcc cnnctttttn ccctttctaa tgnttgngcg 60 

ctcgntctnt ctccacnagn nnntgcgttn cgaattcggn tctnagccca tgccgggatc 120 

ttcccacacc cgtcctcaca gatccagccc cagcccctgt cttcccaggc catctctcag 180 

cagcacctgc aggatgcggg cacccgggag tggagccctc agaacgcatc catgtcggag 240 

tctctctcca tcccagcttc cctgaacgac gcggctttgg ctcagatgaa cagtgaggtg 300 

cagctcctga ctgaaaaggc cctgatggag cttgggggtg ggaagccgct tccgcacccc 360 

cgggcgtggt tcgtctcctt ggatggcagg tccaacgctc acgttagaca ttcatacatt 420 

gatctccaaa gagctggaag gaacggaagt aatgatgcca gtttggactc tggcgtagat 480 

atgaatgaac caaaatcanc ccggaaggga angggagatg ctttgtctct gcagcagaac 540 

tacccgnccg tccaagagca ccancagaaa gancctcanc cccagacagc acggnctaca 600 

cgcanctcgt gnacctggat gacntggaac anaatggtan cnaatgtggg accacngnct 660 

tgtancccna ggacaaggcc ctncnangct tgntggangg gtcnantcng anaaatggng 720 

gccactgccc aacccgcang aaganaacaa nn 752 

<210> 475 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 475 

gntttctntt aatncttttn naaangcgnn ntttaccntt ctangnntgn gnctcgttct 60 

ttcccacnna nnnnncggtn cgaattcggc ncgaggtgaa acagaaagtg gagatgcttt 120 

ccttgacctg aagaagcctc ctgcctccaa atgcccccat cgctatacaa aagaagaact 180 
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cttggatata aaagaactcc cccattccaa acagagcctt catgcctttc tgaaaaatat 24 0 

gacagtgatg gtgtctggga ccctgagaag tggcatgcct ctctctaccc agcttcaggg 300 

cggagctcac cagtggaaag tctgaagaaa gagttggata cagaccggcc ttccctggtg 360 

cgcaggatag tagatccacg agagcgtgtg aaagaagatg acttanatgt tgttctcagc 42 0 

cctcagagac ngagctttgg agggggctgc cacgtgacag ccgctgtcag ctcccggcgc 48 0 

tcangaagtc cattagagaa agatagtgat gggcttcgtc tgcttggtgg acgtaggatt 540 

ggcagtggga ggataatctc tgcccggacc tttgagaagg atcaccgctt aacgataagg 600 

acctgcggga cttgagagac agagaccnan anaaggactt caaggacaac gtttcangan 660 

anaanttttg gagaaagtaa ncntgtcttt tggtgancgt anaaanaaat gattcttacn 720 

cnnaanaaga acccgaatgg tt 742 

<210> 476 
<211> 1122 
<212> DNA 

<213> Homo sapiens 



<400> 476 












gnnngggnnn 


ttctaaaagc 


tgggnnnnrm nnngaggnnc 


ttctaatnct 


tctaatggtt 


60 


ggctctcgtt 


ctttctncac 


gcagcnnngc 


gnnncgaatt 


cggcacgagc 


ctgcagccac 


120 


taatgcattg 


tgtatgataa 


caaaaactct ggtatgacac attttctgng atcattgnta 


180 


attagtgaca 


tagtaacatc 


tgtagcagct ggttagtaaa cctcatgtgg gggtggggtg 


240 


ggggtgtatn 


cctngnggga 


nggnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


300 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


840 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


900 


nnnnnnnnnn 




nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1020 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1080 


nnnnnnnnnn 


annnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nc 




1122 



<210> 477 
<211> 747 
<212> DNA 

<213> Homo sapiens 
<400> 477 

gnngtgcctt tgaaannccn tttgnnnnng nggcccttct aatgctnttn cgntcgnggg 60 
gtcgaactcg ccccacncng cnaggcgggg gctncaagcg attctaaacc acctatgagt 120 
atttctttta gggctcactt aaatacatgt ntgngnntac tgggggctag ccngaataat 180 
tttagatctg atcaggtngn ngctnaaatt ngaaaaanac cnnntngatg cttaaagaat 240 
tngcntccat ttttgagtct aaatctttta aaatntactg ngatccacat ctagngaaat 300 
gtcngtgtca anatattctn gatnatcgct naaatccnca ttaatactcn ttnggggtnn 360 
nnnatagngg aacttcntag nnntncnaaa agcacatngn cttcctgnct ccgctgctcc 420 
cacagnnggt nttgnaactg ggnaaatcag nnnnnngata gcgngngnnt ntnaganaaa 480 
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ntngatncac acatncttnn nnctcagncn ncacatngat tgaacactct ggccaagatg 
ctgnggngga tgangttgga gttcgannga agaagccngc gctggcctgg cttgnaagac 
ccnngncttt cccntnccct cnctngaaag ctgcccngac ngaggccnaa ngnaaatggn 
tganngnncn gtcnngcccn cttcngncnc ttngaaccnn nnagnggnnc tnnnngnacc 
cnnngnnntn cgngnaaccg nnccngc 



540 
600 
660 
720 
747 



<210> 478 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 478 

gnnnnnngcg cgnccttcta atgcttcnta attnnctngg atactcgttc tttctncagg 60 

natcnnntgc gnttcgcaag gagnagagtg atagnaattg gcagtgaaat atacgaacca 120 

ccctcctgcc ctctgggttc acaatacgtg tacacttgac tgtgaagtgg ctgtgagagt 180 

gggtggagag ttcttctttg accctcagcc tgcggatgcc tctagaaacc tcgtgttgat 240 

tgcaggagga gtcggaatta accctctgct ttccatcctg cggcacgcag cagatctcct 3 00 

cagagagcag gcaaacaaaa gaaatggata tgagatagga acaataaaac tattctacag 360 

tgcaaaaaat accagcgaac tcctgtttaa gaaaaatatc cttgatttag taaatgaatt 420 

tcctgagaag attgcatgca gtttgcatgt tacaaaacag actacacaaa tcaatgcgga 480 

actcaagcca tacatcacgg aaggaagaat aacggagaag gagataagag atcatatttc 540 

aaaagagact ttgttctata tttgtggccc acctccaatg acagactttt tctccaagca 600 

actggaaaac aaccatgtac ccaaagaaca catttgcttt gagaagtggt ggtaggaggc 660 

aagaccaaag gcaggaaaaa attaangagg tgagatctac tcaaggagag ctcaaaaaaa 720 

aaaaaaaaaa actngggccc tttaga 746 

<210> 479 
<211> 750 
<212> DNA 

<213> Homo sapiens . 
<400> 479 

gnnnnnnnnn nngngnnnnn ttctannntt cntattnnct nggagctcgt tctttctnca 60 

ggatcccntg cgattcgaat tcggcacgag ggtagactgg ctagggatcc tggacccagg 120 

gttccacgta gcaacacctg ctgagttctc tgggttttct tcctgcctca tgtagcccag 180 

acttggagct gaagaagctg gaaacatgga aacaccaaca gctacagacc aaaaaaagtc 240 

ccaacaaagg cctgtcagtc tgccagcctg ttctgtggat ttccaactca agattgcagc 300 

atcaactcac acctgaagtt ctggcttccc tacaaacttt gaacttgcca gtccccacaa 360 

tggcataagc caattcctta aaatgaatgt ctagttctag ataatgtgtg tattctactg 420 

gttctgtttc tctggagaag cctactaata gatcatttgt cttagtcaat tcaagctact 4 80 

ggtacagatt accatagact gggtggttaa aactaccaat cttattactc acagtttttg 540 

gagtctggaa agtctgagat cagggttcca gcaggattga gttctttggt gaacatnctc 6 00 

tttctggnct acagaatact gggttacttt aagtnggaaa aagtagggtg aagctgggtc 660 

ntttggcctc ttcttttaag ggggactaat tcatgaaggg ttccaccctt attgacctat 720 

tttaccttnc caaanggntt ccattttccn 750 

<210> 480 
<211> 714 
<212> DNA 
<213> Homo sapiens 
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<400> 480 



gnnnnnngnn nngnggnnnt 
gctaggcttg ngaattcggc 
ttatttgaac cctataccaa 
acagtgccct gcatgaaggg 
gtggctttgc tctctccatt 
tctactgttt ttgcctaaac 
taatattaca ttgttctgga 
ggacaatcag attatttcat 
agctcttctt tggtgttggc 
gactgcaata aagtatagct 
tcggtgaaat aaangcaaga 
tcgaccttta nactatnnga 



tcnaccgttc ttntgaccta gnctcgntnt nnccnacnna 
acgagataac acacatcaca gtatgctctc agaaatttct 
tatctgntga tcaatgacca tttttgctca gcatggagaa 
tagtgagaat aaaaaggatc ttaccacctt tatcatgagg 
ccaagttgtt ctctgttcta gaaagcagat gtagtagaca 
agaatccctt tttccttttt ttgttaaaag tactcatccc 
aggactgaaa ataacagaac tcagcaccat gatcggaccg 
tcctcancaa acggagatcg atccgaaaag tggaaatatg 
atatggaccc tgagagaaag aactttaatt ttttctcttg 
gcctaaaata cgtttcctga ccttggangt ttgnccacaa 
cgtacacttg gatgaaaaaa aaannnnnnn naaaaaaaac 
gtcgatacnt aatcngactg atagatcatt gnta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



<210> 481 
<211> 742 
<212> DNA 

<213> Homo sapiens 
<400> 481 

agccnttcta aangccnttt gnctnngnnc cccnttccta anncntggct aatncttggc 60 

nactcgttct ttctncacgc acccatcgnn ncgaattcgg cacgaggcat gaaaggagtc 120 

ggaagcggaa gcggtagccc ggacggtgct gtggtgcaag ggcttgtgga aaaattggag 180 

aaaaccaagt ccctggccca gcagttgaca agggaggcca ctcaagcgga aattgaagca 240 

gataggtctt atcagcacag tctccgcctc ctggattcag tgtctcggct tcagggagtc 300 

agtgatcagt cctttcaggt ggaagaagca aagaggatca aacaaaaagc ggattcactc 360 

tcaagcctgg taaccaggca tatggatgag ttcaagcgta cacagaagaa tctgggaaac 420 

tggaaagaag aagcacagca gctcttacag aatggaaaaa gtgggagaga gaaatcagat 480 

cagctgcttt cccgtgccaa tcttgctaaa agcagancac aagaagcact gagtatgggc 540 

aatgccactt tttatgaagt tgagagcatc cttaaaaacc tcagagagtt tgacctgcag 600 

gtggacaaca gaaaacagaa ctgaagaacc atgaagagac tctnctacat caccagaagg 660 

ttcagancca atgacaagac ccancaagca naagagccct ggggagccct ctgctgatcc 720 

caaanggcaa aaaatggggc cn 742 

<210> 482 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 482 

gnnngnggag nnttctaatg ctttgntcta gagtcncnnt tctaanggct tggnaatnct 60 

ngctcttgtt ctttntgcag gatcccatcg attcgaattc ggcacgaggc caagcctcgg 120 

cctccactgc acctgctgcg gagtggcacc tttgcctgca aggccctcta ccccatggcc 180 

cagtgtcatc tcagcagggt ctttggccac tcaggaggcc cttgtggtgg gttgctcagt 240 

ctgtccttcc ctcatgagaa gctactgctt atgtccacag accaggagga gctgtcacgc 300 

tggtaccaca gtctgacttg ggctatcagc agccagaaaa actagaggaa tcttatagat 360 

tccagaactc aggatacctc agggataggt cacagccaag agtacaaagg aatcttcagt 420 

actgaacaaa acagaaccct tcatgatttg acaaaggtca ctttctgttt gcctggacca 480 

agctactcca gatcatctga ccaactctta aaaatcacgg ccaggcacag tggctcatgc 540 

ctgtaatccc agcactttgg gaagcaaang tggcaggatc attccagccc aggagtttca 600 

agancagcct ggcaacacag tgagtgagac cctgtctcta tttaagaaaa aaattattaa 660 
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gaaattttat taaaaaagga agaatcagga aaccaaagtc aaccccaact taaccctcaa 720 
tgaaccagcc ctaacacaga tgangggatt tg 752 

<210> 483 

<211> 649 

<212> DNA 

<213> Homo sapiens 



<400> 483 

gnnnnnattn ccctttnaaa tnccncngaa ancccttgga agcactaccn ctcngacccc 60 

tttggaacgn cgactnctnn atatatcnng gatataatag gtgataagtt ctgncaatta 120 

gtaacatcng gaaaaaacag ctnngncctg ggngaaaaag gatgccaaaa tngcctggaa 180 

aagagcagng gagaggagtc cgggagatgn gngatgcatc gggacgcanc atngntnaac 24 0 

attcactggg tctgccaaaa atgtggattt gngggctgct tagatngtta caaggcaaaa 300 

ggaaaggaaa gagttctaga gataaaagaa ctatatgctt ggatgaagtg tgtgaaggga 360 

cagcctcatg atcaccaaca tttaatgccc aacccaaaat tataccnggt tctgntttga 420 

cagacttcta gatgccatgc acactcttag ggaaaaaata ttgggattaa ancccatngg 4 80 

cattggacta acaaacagga atttacaagg tnggaaantt ttncnaccaa tgaaaggggg 540 

gatcncaagg ttttccagaa nggntcntaa tcncaggnaa taaaaattnc tctngggcaa 600 

gccctgagtc ttaancagca aaaanactcc tcccgaancc tgnagaaaaa agggggggca 660 

gccaggcccn naaanggaan gtnaggcccn agatnaacaa ngtnacctcc ncccagnaaa 720 

ccccannccc caactggnac cngggnaacc cacaacnttt gcngaagncc aaaaaagncc 780 

nnnagangga aaaaaaaaaa naananaaaa aacctnnnag cccctaagaa accttagggg 84 0 

nggcccncc 849 



<210> 484 

<211> 1098 

<212> DNA 

<213> Homo sapiens 

<400> 484 

gnnnnnnnnt ttnnnnnttt ttgnaaaanc ccccttttgc naaatngncc ctttttntgg 60 

cangggatcc ccatntttat ntcggacatt ttcgggccac cggaaggggc cgggggcccc 120 

cgggccncca ggnccgggna aaggcccccc ttgggcggcc cccggncggc cccaatgggt 18 0 

tccaaaaagg gaaaaaaaaa aaagggggaa cctgggaagt tggcccanga aaangnaaaa 24 0 

aaaggnaagn aaaccttccg ccaatgggaa tggggaaaaa taattttttc ttgaaaaacc 300 

caaaaaagga atggttattt ttcaaattta aaaaaggaac nttgggaaga aagaattggc 360 

ttcccacncg cagaaagggc attactggct atgtcaagta aaagaagtcc ttcaaagctt 420 

agttgatgat ggtatggttg actgtgagag gatcggaact tctaattatt attgggcttt 480 

tccaagtaaa gctcttcatg caagggaaac ataagttgga ggttctggaa tctcaagttg 540 

tctgagggaa gtcaaaagca tgcaagccta cagaaaagca tttgagaaag ctaaaattgg 600 

ccgatgttga aacggaagag cgaaccaagg ctntgcaaaa agagcttttc tttcactttc 660 

gagaccaaag gggaaccagc tnnaagggcn agaaaagttn gaaaaaaatt ccaaaggaac 720 

tggtggaatc ccccaaaagg tttggttggg gaaagaaaaa ttcccgcccc aangccaaaa 780 

tttaaaaggt ttngccccca aagggaaaag ncttgncctt taacccagga attggggacc 840 

ctgggantta aaaccnataa ttttcccgcc naattnnaaa aaattcnttt nggggncccc 900 

naaaanggna aaaaaatttt nggggggttt tggnaaggna aaaatttnaa atttggattt 960 

ngaaactttt ttngggaatt ccccagaaag aacttttgac cttccnttng acctnaaaaa 1020 

ttttcccttg ggggggtgna anggatgttc ccaagctttg tggnatattg gtaaaatttt 1080 

naaccttttn tncttacc 1098 
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<210> 485 
<211> 798 
<212> DNA 

<213> Homo sapiens 



<400> 485 

gnnnnnnant nnnntttnaa atccttnntg aatcctttga antaccatcc cnttttncga 60 

attnggcacg aggaaaggtg gcgcgcttct cacggctgag ttgctgcgcc ttgcagacgg 120 

aagctcccca caggcagagc tgcttggatg tgtgagtcat gaaccagaga agccccgctc 180 

catgagcagt gactccccan gccctgtgac ctccctcctn cttgcagctc ctcctggcac 240 

cagtccccag ggctctcctg ttggtagttc ctgcttttct tcttggaaat tcctcgtgga 300 

cctcgagatc tttaccctaa aatagttctg ttgaatttca ccctggcaat gtaaattgat 360 

agcttatctt cacagatgcc agacaatgga caactcacca tcagtcctct gctcacctga 420 

gacaaatgca tgtctgattg cttcctctgc cctattgntt atgtgaaaat gcagattcac 480 

tgagccagac taaggcatca gtgactgttc ctctacctgc ctctcacatg gagattgtgt 540 

attcagtgaa aggctgatca aagacccaaa ggaatgcaac agtttatctc ttatctacct 600 

atgacctgcg aactggccaa caacccagtt gttgncgcct tttcagacag aaccagtgtc 660 

atcttacacg tattnaaatg gatgtcctgg ngtctnccta atatgtattc aaaagcaagc 720 

tggggcctng accacccttn ggcacatatt cctcanggac atcattcctg angctgtgtc 780 

actggcatgt ccttaanc 798 

<210> 486 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 486 

gnnnnnnttt gaaanccctt tcnaatnctt ggcattgntc tctttgcagg atccctcgat 60 

tcgctgacaa cttgattggg ttctccttca ggtttgaagc gccctcgaga agtgtctaaa 12 0 

ggagacagtt gatagccaaa caacagtttt ggattcactg actgattatg aaagaagcag 180 

tagactggta tcaagaatca gtcagcaagg aggccctcac cagacgccag tgccatgttc 240 

ttggacttct cagcctccat attcatgaac taagtttttg gaatccttag gcttccacgt 300 

gtggaaagcc tgagctaacc tactggagga tgagccatca cctggagcag attcaggcca 360 

tcctagttga agcctcccta ggccaagcaa ccgtccaact accagacatt gaccattcag 420 

ccttgaacat tcagcacaaa gacaaaacag accagaccag aagagtccca cagaataggg 48 0 

gaaactattc agagaaaact taagccacta agttttatgg tgttttgttc tgtagcagaa 54 0 

gcataggcat actgacaata caaaccgaaa tccttctaac gtagtggacc ttttcangcc 600 

agcatttttt ccttgaaaac ctggagcatg tatccatctt atagcagaga tcactttcac 660 

aatggttggg ctcttggatt tgaattgatg atgtaatgag ccctctttnc ngattgnaac 720 

ttaattactc tgggnatttg ntggattccc aaccttctaa tatttacttt tcctctttan 780 

taanc 785 



<210> 487 
<211> 797 
<212> DNA 
<213> Homo sapiens 

<400> 487 

ttgtnnnncc cttttnaaat ncctttggct anttgntctn tttgctngat cccatcgatt 60 

cgaattcggc acgagnnngg actaccttnc aaaaccnggt ngggaagcnt gttacagaan 120 

tgatntctan tcccctgnat tctggatgct gcagaccaac acctgccnac aanacncana 180 
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gananangac 
agaggctngc 
tgggcttcaa 
nctnagtggn 
gccagataaa 
nacnttnaaa 
tggngnctcc 
tgngatttng 
acttggntta 



nncatttacn 



cacacacann 



caancantat catgtaagac agnncgntna ntnnnnnatt ntnatncttn 
cantnttgta nantggntca tgngtctata natnnttgta antattntnt 
ganantctga atcttaagca tatgctccat cnttnnatat gctntggtgg 
cntnattcat nttnncatgg agncaagttt aatgcctcta gantacattc 
gcatncttat ttnnaactcc ctgagtgatg ggtggataaa tcnaacattg 
ntcaagacaa ctttgntggt ggttttgntc acaatcatga aaatggttnn 
tattttgata ttagntttcn tttttnatat anngcggtag gtttgaattg 
tgnntngggt tgtnaagaca ntggnttnca atnnaattta tnacatgaat 
cctttggnga aaccttaaag aanttntgna tacttcttca taaaagggtg 
naantttcgg gggttttnaa tttttnntga agcttatttc ntganaatnt 
ccaagcc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
797 



<210> 488 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 488 

caaatcnntt gctctngttc tttttgcagg atcccatcga ttcgcgacag ctctccaata 60 

ctcaggttaa tgctgaaaaa tcatccaaga cagttattgc aagagtttaa tttttgaaaa 120 

ctggctactg ctctgtgttt acagacgtgt gcagttgtag gcatgtagct acaggacatt 180 

tntannggcc caggatcgtt ttttcccagg gcaagcagaa gagaaaatgfc tgtatatgtc 240 

ttttacccgg cacattcccc ttgcctaaat acaagggctg gagtctgcac gggacctatt 300 

agagtatttt ccacaatgat gatgatttca gcagggatga cgtcatcatc acattcaggg 360 

ctattttttc cccacaaacc caagggcagg ggccactctt agctaaatcc ctccccgtga 420 

ctgcaataga accctctggg gagctcagga aggggtgtgc tgagttctat aatataagct 480 

gccatatatt ttgtagacaa gtatggctcc tccgtatctc cctcttccct aggagaggag 54 0 

tgtgaagcaa ggagcttaga taagacaccc cctcaaaccc attccctctt caggagacct 600 

acccttcaca ggcacangtc ccccaaatga gaagtctgnt acccctcatt tcttnatctt 660 

tttacttaaa ctcaagaggc agtgacaggn agtcaggggc aagacattac atttttcata 720 

ctttcccaca tctgaaaaga tgacagggga aactgcaaag cc 762 

<210> 489 
<211> 822 
<212> DNA 

<213> Homo sapiens 
<400> 489 

ttnnnnnnct nnnggnnttt cnaatncttg tttctcgncc tttctgcagg atcccatcga 60 

ttcgaattcg gcacgaggat tttcgaaact cttcagctac ttgccctttt ttatctgaaa 120 

ccatcatacc ttctgaaaga aaaaagcata tcttcattga cataacagaa gtgagatggc 180 

ccagtcttga tacagatggt accatcntnt atatggagag tggcattgtg aagataacat 240 

ctttagatgg tcatgcatac ctctgcctgc ccagatctca gcatgaattt acagtacatt 300 

ttttgtgtaa agttagccag aagtcagact catctgcagt gttgtcagaa acaaataata 360 

aagccccaaa agataaacta gttgaaaaaa ctggcaaaat ctgtatacgt ggaaatttac 420 

cangacagag actgaagaat aaagaaaatg agtttcattg ccagatcatg aaatccaaag 4 80 

aaactttaaa gaagatgagt tgtgtaaatg gaactgaagg gagggaagag ctgccttcgc 54 0 

ctggtacaaa gcacacatgt gtatacacat gggtcaagca gtgctggtct gtggctgcct 600 

gtccagagga atgggaaata ttcctttgtc tttagcactt catttttcta aataaaaatc 660 

anccaatatg tctaaaaaaa aantttnttn ataataaacc tngaagccct nttanaacct 720 

tntnntggag gtcctnnttt accntatgat tcccggaact tggataagga atcccntttg 780 
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gattgganat tttgggccna aaacccncna nncttggaat cc 822 

<210> 490 
<211> 789 
<212> DNA 
<213> Homo sapiens 

<400> 490 

ntgtaancct tttcaaatcc cttggctact tgntctttct gcaggatccc atcgattcga 60 

attcggcacg aggccggacn gtgactctgg nnacgcttgc gnccntnacg tagntngnng 120 

accntgcang anggaanaan ggctggccnn cngntgtacn ctnaccgtcc taaccccgcg 180 

aggtccaggn ccgctccttt cggngnggat tctcgcggaa natccctccg gcagctcttt 24 0 

gcaaagctgn ttagaaactt ctcccaaact cggcntggat acgactgcta tagggctcgc 300 

tgctgctttt gtggagctct tgctcctcta tccttggcct ctcctgggat acggcccaag 360 

gccaagtntt cacgcangtt ggtacgctta tttcgttctg gactctgggg gctntgaann 420 

ttcaccacgt ggactgctgg ggancgggnt nccgancact ngnntacctt acnccanaat 480 

ctgacaactt ttctggacaa cctacccanc ttcaattggc tngngagcnc ntcngntgct 540 

ggggnntncn gtgcaaatgg agncncaatt ggtgggcaaa tngttgatgg ncaaaacggg 600 

aaaaagcaac nnncaangct tttggctnaa agccgatang acncaaatta nttnctttgg 660 

accttganaa tttcctcaan nnttttnagn anncnctttt ttncttggan aaanacttaa 720 

aagtgaacga ttnttgggaa anaaacaaac tataataact naaagctttt ntaaaaaaaa 780 

annaatnnt 789 

<210> 491 
<211> 790 
<212> DNA 
<213> Homo sapiens 

<400> 491 

tccaaaatnc ccttggantn attccccctt ncaatacctt tccttngnac actcccngtt 60 

tngntngatc ccatcgattc gaattcggca cgaggnaaca aagaaggaat gtcttcctca 120 

tgtttnggtc tatagaagac gttaaagaaa acttccagaa agtgggtttg aggcatgagc 180 

caccacgcct ggccaaagga tttaatgaat taatggatgt acagtgctgg ggctgttatt 240 

ctagggcctg cattgagact cacattttgc catcaaaagc cttttaagag gtggaggttg 300 

cggtgagctg acatggtgcc actgcactcc ggcctgagtg acagagtgag actctgtctc 360 

acaaaaaaaa taatgccctt taaataatga ataatagtga tagaaaatgt catttcttgg 420 

acaaatgaaa aattgaaatt aatgtatata attagatatt attagctact cttaggtagc 480 

ttcatttgtt gaaagtttga caagtgaatg aagttcacat ctggaaatcg ttgaacattt 540 

ttcgttcatg gaactcaatg gctacgttag tcgtttatgc ttttcactgt tgtggtaggg 600 

gctttggaaa gtnaatgcca tcaacaatgg atacagaang acctggattt ggaataaggg 660 

caaaaattta ttttgatggg gctgaattgc tctgccaggg agcattttgg gtattgagat 720 

gaaaatggcc tctctttgag actgagctgc cacctggcaa attattgnct gcttaanggt 780 

tctctttatn 790 

<210> 492 
<211> 804 
<212> DNA 
<213> Homo sapiens 

<400> 492 

tcnaaatccc ttttgnnagn ttcncncttt gtttcccttt nctnggctnc ttgttctttt 60 
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tgcaggaatc ccatcgattc gaattcggca cgaggtcctt ttgaaccacc ccaaagaact 120 

caacatggca aagcaaatgg taaaagcttc ccgactgttc tactttgggt ccgcgcgaag 180 

cccactcacg tgtgatctgt gttgcccctg ggaggcccgg ggcgaccgga aaagggctct 240 

ctcaagttct gaaaagagaa tctgccacca gatcgaattt cgacccctga gcttgttcgg 300 

acgtatggtc caaattcaga ttaaggtggt cacccaaccc gagatgtcag gaaaggcctt 360 

ctgcagagaa aatgtccccc cacccgccat ctgcagccag gtgtgtgcca cacggcagcc 420 

ttcccgaaac atagtatgga ttttaaaaat gtgtttattt ttgtttctca accactttat 480 

aacgtatttt ttaatttatt ttgtaatgtc ttgttttgaa gtattgctgc tatccttgnt 540 

atccttccca ctgtttttat cactgattta ttttgtgaaa agttgtacac taatgttcta 600 

tgtcaaaatc aaaaagtatt taatgaaata ctagttctat ttaatgtggg ntatggaacc 660 

ancttggaaa cacaaaacaa acaggggatt gtacaagcan gcttggggcc caagnaaggt 72 0 

caaggttcat ttggttacca tatgccnata aaacctcanc gaanttttaa aaaaaaaann 780 

nnnnnnaaaa aancttggng ggct 8 04 

<210> 493 
<211> 800 
<212> DNA 

<213> Homo sapiens 
<400> 493 

ggnncnnttt nccccccttt tgaaaacccc ttttgggnga ancccncttc tttnaaatcn 60 

cttggctact cgctctttnt gcaggatccc atcgattcga attcggcacg agtatataac 120 

aacttttgct ttcaaagttg ggtgggacta gaacacacaa tggaaggatg gagtcaggag 180 

acctggattc ttgtgcccgc tctggctttt acagtctgcc taactctatg cagtcacttc 240 

ctgccagcct gtttccttac ctacaagagg gagagacact ccctggccag cctagttctc 3 00 

agggtgaacg aaaggtcatt atcactgcat cctctagtca tttgcttctt cgctaattaa 360 

cacatcttga gcacctgcga tgttccagga acaggagatg gcagcgtgca agataaaagt 420 

ccctgacttc tagagactgc atgttagtgg caatcggcgt ctacccggcc ttcaataaac 480 

tactgaatga aggaaaattc tacctagcac cagacacaat tactgggttt ctaaaatgga 540 

attattcccc cggccccctg catccagcag cctgctgcag ggaagctcct ccgaagctgt 600 

aggcaggagc gggacaaatg cttgctatca gcttcacaga atgttaccta agtactattc 660 

ctacacagcg ccttacagaa caaacagtaa aaaccaaatg gnaagcatgc acnggcttaa 720 

aaactcaaac ttcctaacta ctcagtaatt anganggtca ttttacccca aaatagaatt 780 

ttcnatttat ccaataanaa 800 

<210> 494 
<211> 757 
<212> DNA 

<213> Homo sapiens 
<400> 494 

nggnttcnnt ctaactnaaa cngttnggna actcncctct ntctgtngat cccatcgatt 60 

cgctaacaag cgattctaaa ccacctatga gtatttcttt tagggctcac ttaaatacat 120 

gtttgtatat actgtattct agccagaata attttagatc tgatcaggta gtagctaaaa 180 

ttagaaaaaa acaaaataga tgcttaaaga atttgcatcc atttttgagt ctaaatcttt 240 

taaaatatac tgagatccac atctagtgaa atgtcagtgt caaaatatta tagattatag 3 00 

ctaaaatcca gattaatact catttggggt tttttatagt ggaacttcat agtaatacaa 360 

aaagcagatt gtcttcctgt ctccgctgct cccacagtag gtattgaaac tggtaaaatc 420 

agttttttga tagtgtgtgt atataagaaa aaatagatac acacattctt ttttctcagt 480 

caacacattg attgaacact ctggcaaaga tgctgtggtg gatgangttg gagttcgaaa 540 

agaagaagca agcgctggcc tgccttgaaa gaacccgaaa gtctttccca ttcacttctc 600 
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tagaaagctg ccaagacaga ngcagaaagg aaatggatga tagttctgtc aagcacactt 660 

ctgntctcnt agaacttaga aatggttcta agagaacaga agttatngag aacagttcnt 720 

gtggaattca acatcttggg tgggacncat tggcttt 757 

<210> 495 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 495 

ggnnnnnntc ttttcnaatg cttggctctc gttctttntg caggatccct cgattcgcaa 60 

gagagagtga tagaattggc agtgaaatat acgaaccacc ctcctgccct ctgggttcac 120 

aatacgtgta cacttgactg tgaagtggct gtgagagtgg gtggagagtt cttctttgac 180 

cctcagcctg cggatgcctc tagaaacctc gtgttgattg caggaggagt cggaattaac 240 

cctctgcttt ccatcctgcg gcacgcagca gatctcctca gagagcaggc aaacaaaaga 3 00 

aatggatatg agataggaac aataaaacta ttctacagtg caaaaaatac cagcgaactc 360 

ctgtttaaga aaaatatcct tgatttagta aatgaatttc ctgagaagat tgcatgcagt 420 

ttgcatgtta caaaacagac tacacaaatc aatgcggaac tcaagccata catnacggaa 480 

ggaagaataa cggagaagga gataagagat catatttcaa aagagacttt gttctatatt 540 

tgtggccacc ttcaatgaca gactttttct ccaagcaact ggaaaacaac catgtcccaa 600 

agaacacatt tgctttgaga agtggtggta ggaggcagac aaaggcagaa aaaattaaga 660 

ggtgagatct actcaggaga gctcaaaann aaaaaaaaaa aaactnggac ctntagaact 720 

atagtgagtc gtnttccgta gatccagaca tgataa 756 

<210> 496 
<211> 744 
<212> DNA 

<213> Homo sapiens 
<400> 496 

ctttnaatcc cttgcactcg tcttntgnag gaccttatcg attcgaattc ggcacgagat 60 

aacacacatc acagtatgct ctcagaaatt tctttatttg aaccctatac caatatctgt 120 

tgatcaatga ccatttttgc tcagcatgga gaaacagtgc cctgcatgaa gggtagtgag 180 

aataaaaagg atcttaccac ctttatcatg agggtggctt tgctctctcc attccaagtt 240 

gttctctgtt ctagaaagca gatgtagtag acatctactg tttttgccta aacagaatcc 300 

ctttttcctt tttttgttaa aagtactcat ccctaatatt acattgttct ggaaggactg 360 

aaaataacag aactcagcac catgatcgga ccgggacaat cagattattt cattcctcag 420 

caaacggaga tcgatccgaa aagtggaaat atgagctctt ctttggtgtt ggcatatgga 480 

ccctgagaga aagaacttta attttttctc ttggactgca ataaagtata gctgcctaaa 540 

ataccgtttc ctgacacttg gaggtttgcc acaatcggtg aaataaaggc aagacgtaac 600 

actggatgaa aaaaaaaaan nnnnnnaaaa aaactcgagc ctntagaact atgtgatcga 660 

ttcgtagatc cagaatgata gatcattgtg agtttggaca accacactng atgcagtgaa 720 

aaaatcttat tgngaattgn gatn 744 

<210> 497 
<211> 772 
<212> DNA 

<213> Homo sapiens 
<400> 497 

gnttgnngtn taantttnta aggatccctt tntntgaanc cctttctgca ggatcccatc 60 
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gattcgaatt cggcacgagg caggagnaat cacttgaacc ctggaggttn cggttgcagt 120 

gagcacagat catgccactg cactccagcc tgggcaacaa aacgagactt cgtctcaaaa 180 

aaaaaaaaca tagaatttgg atcctttggt cgggttctcc caaattcttt tgaggtgtcc 240 

atggtcaact gcttcagctt tgttttggca accccctgcc cgaagtcgca tataggctgt 300 

tcttcacctt gtttccaagg ctgaggaaca gaaagtagcc tctgttttga ggaggtggaa 360 

gttaagtata catttatttt ttactgtgac ttgttcagga ccacatttta caaaatgcct 420 

tgtttccttc attgtttctg gaaaggaaag ttctattaat attgntttac tttgaatata 480 

gaatagtttt tttaattagg gcttattttg aaaaattctg agtttaattc aaatgtatgc 540 

caataccttc caaagtaagg taatattcag agacagttgt tggtgatcag atggcttaga 600 

gaaaatttct ggaatattca cattcgaaga tccttattat gaatgtcttt gacttaaatc 660 

taaccaaaaa ctgcacatta ttctttgnac attttcatta tatagngtta acaagcttan 720 

ttgcaaacca ataaatactt aagctattta aaaaaaaaaa aaaaaaactc nc 772 

<210> 498 
<211> 773 
<212> DNA 
<213> Homo sapiens 



<400> 498 

nttnagcnta nnagccgttg tantgaagcc cntttgctac ttgctctttt tgcaggatcc 60 

catcgattcg aattcggcac gaggacccag gtagaccagc tcaagagttc atgttctttg 120 

tcatcctcct gtgagctctc tgtaagtctc tntcttgccc atcaccacat ccctagtact 180 

gggtatcagt ctggccactt ggctttctgg tttgccccaa tgtggtctat tcttgatgca 240 

gctaccaaag taatgttnta aaaccattat accaagttac tatccttgtc aaaaccccca 300 

gtaactgcca atctcactta gaataaaatc cggactcctg tgaagcacag nataaactgg 360 

cactgcctat gcagcaacct catctttacc gtttctgcct tgctcactcc cttcagcgcc 420 

ggtattcttc ctgatgcccc tagtacacaa caactccttc ctgctccaag agtaggaaaa 4 80 

tnactgtctc tctgccagtg agattcctct tctggtatta cctntgcttc attgctgaat 540 

cttctgcaat atcatcttct aaaaagagcc tttnaaaatc accttttcta ttatgcccta 600 

ctcantttcc agtccctgaa tggccattcc ccactttcat agccacttaa ttgctatctg 660 

aaattacact taaaatggtc accttcatga tgggaaggca attaattgcc tttgtcactg 720 

gtatgtctag agaacaagca gnttggctca tagtaggcac tcaacaaaaa ttt 773 



<210> 499 

<211> 735 

<212> DNA 

<213> Homo sapiens 



<400> 499 

gcttcaatan ctttttctaa ngctcttttt gcaggattcc atcgattcga attcggcacg 60 

agagtaccca nanttgcnag gagtntnntn actgatntag ccaggtggca atnatgagtg 120 

aatggatnaa naaaggcccc ttagaatggc aagatnncat ttacnnagag gtccnagtgn 180 

canccagtga cangaatgag tttnaaggga tgggttttaa ctacagaccc agnctctgcc 240 

aatatngacc ttgtgaactt ccttgaagat ggcancatgt ctgagaccgg aattatggga 3 00 

catgctgtgc agactgttga aactntgaat gaaggggacc atagagtgag ggataagctg 360 

atgcattttg ttcacgtctg gagactgcaa agcatacagc ccacaggatc tggaagagag 420 

aaagaacagc ctanagnaaa tggctngaga ngaaccacat tcccatcact gaacagggan 4 80 

.acgcttcaag gactctctgt gtggctgggg ncctgactat ngacccacca tatggtcana 540 

naaattncac cagctctnat gagantattn tgtcgcgtgt tcaggatctt antgaaggac 600 

atcttacant ttnccaanna naagncatga aatgtgacat tctgcttgaa naagacnata 660 

ttttatcctc atnaatgttt aaatgtaaaa nnnnananaa aanactcgag ctntnaaatn 720 
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tngtgagttn anang 735 

<210> 500 
<211> 926 
<212> DNA 

<213> Homo sapiens 
<400> 500 

tttaagccct ttctactnct cttttgcagg attccatcgn ttcgaattcg gcacgaggat 60 

ctctatacta gtgaacagtg ccagttccac actttggact tagaactgtt ctctagttat 120 

tgtaacacag aatactgtca atccctaatt tacttaatgt tacttattgg aagtggggct 180 

gatgaaatac gcacaggagg gaaatctact gtgtttaggc acaggcagnc ccagtgtata 240 

aggagatcat attccaaang gttgtcagtt ggntgtttgc aacctggaat gtattttcct 300 

ttagagacca ngttatccat ggtggttagg cccctagagc agctggaaaa agatgatcaa 36 0 

accaataggt tngctgacat cnaataatgt aataagtttg ctaaaggaat ctaccatcaa 420 

atntnatatt gnttccaggg aaggttgtnn nttaanntnc cntcttngtg ncatantgga 480 

cnntcccntn ccagtcatnt ncntnannnc tngggcnngt ntngnnttng tntntttngn 540 

cnnctnanca atatttcata tcncccctng ctaaaattct ttnanannaa nttctcantt 600 

tctcccttta ctanaanttt ngtntttnnt ccntttanta tttnnnccta tntntntcgt 660 

tcnnanatnt cattnnntnn ttntnngctn ntnnatcacc cttanctcnn tctcanntat 720 

cntnntcnta ttatctctnt attnntcnct tntnatnatc nttccnnntt gtntanncna 780 

ttatntcttg ttnntntnct cncatctctn tcntnttctc ngctnannnn actccnnnnn 840 

tcncnctcnt nnnnanatnc atatnctnct ttngntatat annnnnntnt ntacntanct 900 

cnnnatnnca tnncnatatn nttngt 926 

<210> 501 
<211> 706 
<212> DNA 
<213> Homo sapiens 

<400> 501 

naatncttgg ctcttgttct ttntgcagga tcccatcgat tcgaattcgg cacgagaatg 60 

caaagggctg cagttctcat tcaggctact ttcaggatgc acagaacata tattacattt 120 

cagacttgga aacatgcttc aattctaatt cagcaacatt atcgaacata tagagctgca 180 

aaattgcaaa gagaaaatta tatcagacaa tggcattctg ctgtggttat tcaggctgca 24 0 

tataaaggaa tgaaagcaag acaactttta agggaaaaac acaaagcttc tattgtaata 300 

caaggcacct acagaatgta taggcagtat tgtttctacc aaaagcttca gtgggctaca 360 

aaaatcatac aagaaaaata tagagcaaat aaaaagaaac agaaagtatt tcaacacaat 420 

gaacttaaga aagagacttg tgttcaggca ggttttcagg acatgaacat aaaaaaacag 480 

attcaggaac agcaccaggc tgccattatt attcagaagc attgtaaagc ctttaaaata 540 

aggaagcatt atctccacat tagagcacag tagtttctat tcaaagaaga tacagaaaac 600 

taactgcagt gcgtcccaag cagttatttg tatcagtctt attacagagc tttaagtcca 660 

aagatatcaa atatgcacgg gctgcacact aatcagtctt ctatca 706 

<210> 502 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 502 

ttnntttttt tggttaccct ttgctctngg nctttttgca ggatccctcg attcgaattc 60 
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ggcacgagcc ttccacggtt 
ctgagtgttg gaattgtaag 
caccataaaa cacacaggtg 
cacagaccct tagtcccttc 
tgggatgtga caccgagata 
tttggtgaag taggtggtct 
aacagtggtt ggtaggccat 
tgatgttatt agcaactgtg 
aattgttagg agttttatta 
ntncnnannn nccncccacc 
gnngaggccn tatttacgtt 
ntttngggcc aaacccccaa 
gnnt 



atttcacaga tatggagagc tggaagcagg gagtgagtct 
ggatcagaag cagggatcag aagcagtggt gaagttcatc 
actttgcctt gaatctgcag gactgaagcc aactcttggg 
cttggccact ctaagtcaga tagtccagag ccaggccctt 
aatcagagaa aagctgtgaa gcttggggaa cagagggact 
gcagtttcta tcttcttggg aaaagcaagc tggaaaagtg 
agtgctccca gctgggtgac ataatgacca cacagcacag 
tggtggagta gttgtgggct ggacaaatca atcgtgtgga 
cattaaactt gttaacctaa aataccatca aaaaaaaaaa 
nancntncna aaaaaancct cganccttta aaaacnnntn 
anattccaga cnttgaatan ggatnccatt tgnattgaaa 
ccttngaatt gccattngaa aaaaaaatgc cttttatttt 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
784 



<210> 503 
<211> 764 
<212> DNA 

<213> Homo sapiens 
<400> 503 

ttnntnttcc ttgaancctt tttctacann cncctttgca gatcccncgt tcgaattcgg 60 

cacgagagac aaagaaaagg tggcaatcat agaagagttt ntagtaggtt atgaaacctc 120 

tctaaaaagc tgccggttat ttaaccccaa tgatgatgga aaggaggaac caccaaccac 180 

attactttgg gtccagtact acttggcaca acattatgac aaaattggtc agccatctat 240 

tgctttggag tacataaata ctgctattga aagtacacct acattaatag aactctttct 300 

cgtgaaagct aaaatctata agcatgctgg aaatattaaa gaagctgcaa ggtggatgga 360 

tgaggcccag gccttggaca cagcagacag atttatcaac tccaaatgtg caaaatacat 42 0 

gctaaaagcc aacctgatta aagaagctga agaaatgtgc tcaaagttta caagggaagg 480 

aacatcagcg gtagagaatt tgaatgaaat gcagtgcatg tggttccaaa cagaatgtgc 540 

ccaggcttat aaagcaatga attaaatttg gtgaagcact taagaaatgt cattgagatt 600 

gagagacttt tataggaaat cactgatgac ccagtttgac tttcatacat actgtatgan 66 0 

ggaanattac ccttagnatc ttatggtggg actttattta aaaacttnca nnaatgttcn 720 

ttcgacagcc ttccatttta acttcnaagg cnncaangaa ttnt 764 

<210> 504 
<211> 795 
<212> DNA 

<213> Homo sapiens 
<400> 504 

ttgtacntct ttttnnaaac ccntngctac ttgttctctt tgcanggatc cctcgattcg 60 

ggaatctcct agaaagttgt gattttcgag ccatatcctt ctgtggtaga tcctaatgat 120 

cctcagatgt tggccttcaa ccccaggaaa aagaactatg atcgagtaat gaaagcactg 180 

gatagcataa cttctatcag agaaatgaca caagcaccat atctggaaat caagaagcaa 240 

atggataaac aggaccccct tgctcatccc ttactgcaat gggttatatc aagtaataga 300 

tcacatattg tgaaactgcc agttaacagg caattgaagt ttatgcatac tccacatcag 360 

ttccttcttc tcagcagtcc accagccaaa gaatccaatt ttagagctgc taaaaaactc 420 

tttggaagca cctttgcatt tcatggctca cacattgaaa actggcactc ctcctganga 480 

atggtctggt ngttgcttct aatacacgat tgcagctnca tggngcaatg tatggaagtg 54 0 

gaatctatct tagtccaatg tcaagcntat cattttgntt actcagggat gaaccangaa 600 

acagaaaggt ntcagcccag gacgagccac cttcaagcng ttaanaagcc agcaattaca 660 
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ttcacagtcn ccaggaaana aaaggncagn cctatccccc ctttncctgg caaaaggccc 720 

gtnaacctta aanaaactgc ctttagccct ttatnntgga aagtggattc ncncttnatt 780 

cttggacccc tgncn 795 

<210> 505 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 505 

tnntntnntt nantngaacc ctttnctctt gctctttttg caggatccct cgattcgaat 60 

tcggcacgag cacaaggaga agaagttaat taacattgaa ngatgagaag acatcttgga 120 

agaacttgaa ttgggccttg gaagaagaac agccattcaa atagatagaa ttgtggtagc 180 

aaaggcatag aggtaggaaa gtatagatct ccagggacag tagtcatggg gttggggcac 240 

tgttggaatt taaggttgga aggatatatt ggagcccctt gaatacggta acaaggcaca 300 

ccttgggcag tggagagtta tcagagtgtt tgaaaaggag ggttattgag taaataaata 360 

gactggtact ttaggaattt taaaatgtgg atcattgtac tactaataac tatttatttt 420 

atatttacta tctactaagt aatttacatg tattttcttg tactgactgt aaaccttctg 480 

ggtgtgggtg ttttaagtgc cattttactg atnaagaaac tgaggcttaa atagttgaaa 540 

taagtcaccc tgttagtgag tggccagaat gacaagtcag atctanggtt tgtctaactn 600 

ccaaagatria tataaaaata atggatctct ccttttccct tatgcataaa atatggggag 660 

cntttttaaa tcattaccca tncgattgnc caaaaaaata cctttnggga aaactgatta 72 0 

ttantattcc anaataaatt tcaacggcct gcntngnctn ctttacaact ttnt 774 

<210> 506 
<211> 796 
<212> DNA 

<213> Homo sapiens 
<400> 506 

gccnccccnn tttngntctc aacttgtacc ctttttgcan nancncgnnc tncttgcagg 60 

ntcccatcga ttcgaattcg ccacgangtt atattaaatt attctttgtt tttctttttc 120 

ttttaataaa gcctgcaagt tactaaattg tagtttcata aattctgtag taaagtatca 180 

tcttggcagt gtgccaaagg tgaaaatgat gctttctcta acagagaaat tcttagtgac 240 

tccagtcgta gaaaaacgtc tttacaacct gaataagatt gaagaattgt gaacatacca 300 

tggcctattg gatgaatcat ttgccgtagg ctaaatcaga ctgtagggtt tgtgatggat 360 

ttatggagta tgtgggtata gaaatcatga atctagcatt tgttttcaga gattcaagca 420 

tagtcttaag ggtanatcag aaatgacaaa tgaattcaaa acctagcagg tgcattgtna 4 80 

atgtgtgccc agttntgttt tggaaatggc agttccttgg ggtcatgttt ctactggcaa 540 

aatttgcaat antgtnctat tgtntgtaat ttcaaaattt ataagattat cccccgttcg 600 

cccaagtaaa acctgtnctg cccaatanaa tcctggantc gnngagaaat cgcntccatt 660 

cgnngntcaa ctcgggatnc ntcgncttaa naaaatnttn tccnggancc ccntcatnan 720 

gaanaacacc anactattnn gggnacctgn aangctcaat ngcccnngcc ncnnangncn 780 

nttttccngg naannn 796 

<210> 507 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 507 
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ctnntttntt ttngaancct tngctcttgt tctttttgcg gatcccatcg attcgtgaag 60 

aggagacggt gacctgggct ccttatgtgc ctgaaagagt ttgagtttcc tgttaactcc 120 

aaatcaacag tattttcaac aagaaatgtg caattgaaat caagtgctgt ttaagtgcag 180 

ctaggatttc cacaggaaga cacttgcagt gaacagagtt atggagcagc aaaaacacag 240 

atctatttgg aaaaagagaa aacatatgcg ttgtattttg cttcaattat aaaataccat 3 00 

cctctcaaag gtggttctaa attacaaagg actttgattt ctaggtagat tctgggtaga 360 

gacttccttt catattgagg cattaatgac accttttaac ctgggaagca atatgactgg 420 

agttgtactt tgagaagatt aatcaggttt ggttgcagaa tgaaagagaa gatgaagtca 480 

agagattggt ttagaggctc tagcagaagc ttagtcatat ttcaaaatga tcaaatatca 54 0 

agaaaaattc tgagctgcat aacttgtata aagtaatttt cagtgatttt ttcatggtta 600 

tgatnaaaga actggattta nccagaaacc tttacctgga ttcaagattt aatttttcct 660 

ttgagcctca tccttaaagg attttcggga aaacattaag gggagccaaa nccnattggn 720 

tggttgggcn tgccctnnaa ttgcctttgg acttttttaa ccgggctttt gniui 774 

<210> 508 
<211> 724 
<212> DNA 
<213> Homo sapiens 

<400> 508 

cttgcctttg aaaancgttg gctactngtt ctttttgcag gatcccatcg attcgaattc 60 

ggcacgaggc ggcgctgacc cggccggccc cacacccgct cttcctcttc tttgccgcgg 120 

actccctttc ctgcctccaa gacctggtgt ctcccactgt gagcccagct gtcccacagg 180 

cagtccccat ggacctagac tcaccttccc cttgcctcta tgaacctctg ctgggcccag 240 

cccctgtccc agctcccgac ctgcacttcc tgctggactc aggcctccag ctccctgccc 3 00 

agcgagcggc ctcagccacc gcctcccctt tcttccgggc cctgctgtca ggcagctttg 360 

cagaagccca gatggacctg gtgcccctgc gaggtctgtc gcctggtgca gcctggcctg 420 

tcctgcatca tttgcatggt tgtcgggggt gtggggctgn nntggggccc gtgcccacac 480 

cangcnancc cctgtatggg atcanaggcn cgaagangca ntgnangctg ntggcanntn 540 

aantactgnc tgggctggaa nangaactnn taaaagtcnt ngcccnnatc caccttggna 600 

cccnannttn nncchntant cnnngggntn angtggtnnn nnctngggac agntcnntnt 660 

ggnntgncna tngnncnnat gnanacttgg ggttcannaa ncntttccnn atgnaancng 720 

ngtc 724 

<210> 509 
<211> 803 
<212> DNA 

<213> Homo sapiens 
<400> 509 

tnnnnnttta tttcnttcgt tctngntttt attacatcag ctcttttctt tttgcggtcc 60 

ctcgttcgca attcagagac acacataaga aactggaaga agagaaaggc aaaaaggaaa 120 

aagaaagaca ggaaattgag aaagaacgga gagaaagaga gagggagcgt gaaagggaac 180 

gagaaaggcg agaacgggaa cgagaaaggg aaagagaacg tgaacgagaa aaggagaaag 240 

aacgggagcg ggaacgagaa cgggataggg accgtgaccg gacaaaagaa gagaccgaga 300 

tcgggatcga gagagagatc gtgaccggga tagagaaagg agctcagatc gtaataagga 360 

tcgcagtcga tcaagagaaa aaagcagaga tcgtgaaagg gaacgagagc gggaaagaga 420 

gagagagaga gaaccgagag cgagaacgag aacgggagcc gagagagaga gcgagagagg 480 

gaaccgggag cgagaaagag aaaaagacaa aaaacgggac ccgagaagaa gatgaagaag 540 

atgcatacga accgaaaaaa aaaaaaaaaa aactcgagcc tnttaactat agtgagtcgt 600 

attacgtaga tccagacatg ataagataca ttgntgagtt tggacaaccc ccacttgaat 660 



138 



WO 99/33982 PCT/US98/2761 0 



gcagtgaaaa aaatgctttn tttgtgaaat tttgngatgc tnttgctttt tttgtaacca 720 
tttttagctt gcaataaaca agtttnccac caaccanttg cnttcatttt nttntttcan 780 
gttcaagggg aagtttttgg aag 803 

<210> 510 

<211> 789 

<212> DNA 

<213> Homo sapiens 



<400> 510 

gntttnnnnc nnttttaatn tacatacanc tacttgttct ttttgcaggg atcccatcga 60 

ttcgaattcg gcacgaggga acccccacca ttaagctaaa gtaaaaccct tttgagggaa 120 

g a 9erg a 9 act ggggagaagg gaaaagagag aaggcaggga gagtagggag agaaaacctt 180 

ccagcagccc agtaaactgc gggcgaagag atctacccgt ctccctccct cccacagtta 240 

ccattggcct tgtcatcgca agcatttgac aaagacttgc ttgtttgggc ctgtcacctc 300 

ctgaaaggct gctttagctg tggatgccct tgattaaggg agagagcgcc taggagctgc 360 

ctgccccanc tggggtgacg gctgtagggc tgggtctatg ttgcaagccc tatatcctan 420 

catgcagtgg aaagtgctta gctctctccc tcctgacctc tgggcagcca gtcatcaaag 480 

cagagagacg tggcggcatg tgggcagcat gcccaggttc cttgctgact cagcacttat 54 0 

ttctgtagtt ttaaaaaaga atttaatgtt tttggttgta tttttttggg ggggtgaggg 600 

tgggcaaaaa catgggggta gttctgagtt gttagaaatg tttctgaatc aagtttgttt 660 

gaaaacacgt tgtgcctttg tacccattat aagatggtca taanacccaa gaactgataa 720 

gctttgggtt ttttttggtt tggtttggtt ttttgcttca ttttacccat tcatgcctag 780 

ggtttccat 789 



<210> 511 
<211> 776 
<212> DNA 
<213> Homo sapiens 

<400> 511 

catanagntc ttgccttttt gnaggacnct cgattcgaat tcggcacgag cccccatctt 60 

cactggttat tccacttatt taaaatgtcc agaataagca aatctccata tagaggaagt 120 

agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc ttttgcagtg 180 

ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata tgcttaagac 240 

cattgaatta cacactttac gttggtgaat tgtatggatg taaattatag ttcaataaca 300 

tagttacaaa agataatcaa aagcatgaaa gcactgttga tgtggnttgg atctgtgtcc 360 

tcaccgagtc tnatgttgaa atgtaagccc cctggtggga ggcgatggga ttatggggca 420 

gantcctcac aaacgggtta gcccacccgc tcaggctgtt ctcctgatat tgagtcctca 480 

tcacatctgg ttgcttcaaa gtgtgtggng ccttccctct atctcctact gctctggcca 540 

tataagangt gcctgcttct ccttcgcctt ntacatgatt gtaaagtttc ctgagcctcc 600 

tagaacnaaa gctgctgngc tttctgtcca tctacangan cgtgagccca attaaacctc 660 

tttttttttt ttnngaggnn ntttnntnnc nntccnnnca ntttnanann cctngnanng 720 

gtttnnaaaa anaananngn naannnnnnn nncccccngc ccttttaaaa taaaaa 776 



<210> 512 
<211> 917 
<212> DNA 
<213> Homo sapiens 



<400> 512 
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ttatttcata 
tgcgaggttt 
tccttaccct 
gggcgggggc 
gccgccgtac 
cagcttcccg 
caggtccccc 
nccncanggt 
aatgttaatg 
nttggcctta 
nnattncntt 
tttnnttnaa 
ntnangggaa 
nnnttaaaaa 
nttnnnnnna 
ttngnnngna 



nggggnnttt 
annntnn 



naaaacnttt 
nnantttttt 
annantnttt 
ntttnnnccc 



naaanccaan 



aactattgtt 
tcggctttgg 
ggtccgggtg 
ggaccacggc 
gggccccggt 
gggggccggt 
gcggggtccc 
tntncaagga 
anttggaaaa 



ctttttgcag gatccatcga ttcgaattcg gcacgagggc 
ctcctgatat gcagcgacag aattttcggc ccccaactcc 
gaggaggttg gggtagcgga agcagcttcc ggggaacccc 
cgccctcccc tcgagacggg tacgggagtc cgcaccacac 
ctaggccgta cgggagcagt cactctccgc gacacggcgg 
tcgggtctcc gtcccctggc ggctaccctg gctcctactc 
agcagcaatt cggctactcc ccaaggcagg annanaanca 
catntacacc atttggatca nggcgtntta naaaaaaaan 
ntatttnaaa gcctttnaat gnttnnnnna atccttnggg 
attntngtng gngggntntt aannccnnnc aantncnnnn 
nnnccanggn cnnaaaaaaa nggggnaann aaaaaacttt 
tggaaaattt naaancntng gaaaancntt tnnntngttn 
tgggnncnaa aaaacntttt naannnntnn nggttnnnan 
ccaannnnnt nnanngnanc ttttnnantt ngggantaaa 
tttnngnnna atttnnnnnn annnnnnnan nnangggnnt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
917 



<210> 513 
<211> 780 
<212> DNA 
<213> Homo sapiens 

<400> 513 

tnnnnnnttt aaatccatta gctacttgtt ctttttgcag gatcccatcg attcgtgcgg 60 

gagcacccga gcctgcggct ccagacggac gcccgcaagg tgaggtgcat cctgacaggt 120 

cacgagctgc cctgccgcct gccggagctc caggtctaca cccgcggcaa aaagtaccag 180 

cggctggtcc gcgcctcccc ggccttcgac tatgcagagt tcgagccgca catcgtgccc 240 

agcaccaana acccgtangt ggtccncggc ggcgcgggga ggcccagggc aatnngacag 300 

nccctccgnt tgactccgcc agtgctgcag nccctactct ttcanagttg ggagccctgg 360 

gacccaggca ccaattgttc ttgcaaactc accctgcggc acatcaacaa gtgcccanaa 420 

cacgtgctga ngcacaccca aggccggcgg taccagcgag cttttgtgta aatatgaaga 480 

atgtctnaag caaggggtgg agtacatgcc tgctgcctgg tgcacccgan gangaagang 540 

gaaggacaaa tggacngtga acggccttcg cccgcgggaa agcttctggg agcccacatt 600 

caatgatgaa gggggagctg caagtgatga cagcatgaca gacctgtncc cctgactttt 660 

caccagaagg accttgaaca cngaggatgg ggatggactg atgatttttg acaacaaaga 720 

ggttgaaagg caaancccca aaaaaaaggc cttgtgaagg cagganaaan acaacctntc 780 

<210> 514 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 514 

tttnnnngnt ttannncatt ttgctactng ttctttttgc aggatcccat cgattcggaa 60 

ttatagtatt gacgtgaatc ccactgtggt atagattcca taatatgctt gaatattatg 120 

atatagccat ttaataacat tgatttcatt ctgtttaatg aatttggaaa tatgcactga 180 

aagaaatgta aaacatttag aatagctcgt gttatggaaa aaagtgcact gaatttatta 240 

nacaaactta cgaatgctta acttntttac acagcatagg tgaaatcata tttgggctat 3 00 

tgtatactat gaacaatttg taaatgtctt aatttgatgt aaataactct gaaacaagag 360 

aaaaggtttt taacttanag tagccctaaa atatggatgt gcttatataa tcgcttagtt 420 

ttggaactgt atctgagtaa cagaggacag ctgtttttta accctcttct gcaagtttgt 480 
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tgacctacat gggctaatat ggatactaaa aatactacat tgatctaaga agaaactagc 

cttgtggagt atatagatgc ttttcattat acacacaaaa atccctgagg gacattttga 

ggcatgaata taaaacattt ttatttcagt aactttnccc cctgtgtaaa gttactatgg 

tttgggggta caacttcatt ctatagaata ttaagtggga agtgggtgaa ttctactttt 

tatggttggg gtggaccaat ggctatcaag agtgacaaat naaggttaan ggatgattcc 

caaaaaaaaa aaa 



540 
600 
660 
720 
780 
793 



<210> 515 
<211> 770 
<212> DNA 

<213> Homo sapiens 
<400> 515 

cttattncat nnagctcttg ttctttttgc aggatcccat cgattcgaat tcggcacgag 60 

gttgtattgg aaagcagtag tgtggacgaa ttgcgagaga agcttagtga aatcagtggg 120 

attcctttgg atgatattga atttgctaag ggtagaggaa catttccctg tgatatttct 180 

gtccttgata ttcatcaaga tttagactgg aatcctaaag tttctaccct gaatgtctgg 240 

cctctttata tctgtgatga tggtgcggtc atatttatag ggataaaaca gaagaattaa 300 

tggaattgac agatgagcaa agaaatgaac tgatgaaaaa agaaagcagt cgactccaga 360 

agactggaca tcgtgtaaca tactcacctc gtaaagagaa agcactaaaa atatatctgg 420 

atggagcacc aaataaagat ctgactcaag actgactctg atagtgtagc attttccctg 48 0 

ggggagtttt ggttttaatt agatggttca ctaccactgg gtagtgccat tttggccgga 540 

catggttggg gtaacccagt gacaccacac tgattggact gccctacacc aatcagaact 600 

cagtgcccaa tgggccactg ttttgactcg gaatcatgtt gtgcactata gtcaaatgta 660 

ctgtaaagtg gaaanggatg tgccaaaaaa ttaaaaaaaa ccnccaaaaa agcttccaaa 720 

aaaaaacctt taaactatag tgagtcgtnt acntagatcc aacatgataa 770 

<210> 516 
<211> 825 
<212> DNA 

<213> Homo sapiens 
<400> 516 

tttccagttt tanttttttc ancttttnga tcnntttgca ggatccntct tttcgaattc 60 

ggcacgagat tctccctaaa ttgtngatcc cactgtttac naaactgttc tnttgtgctg 120 

gcntgctnan tgctntgtag nncctttctg nacnntaggc attgctcttg gagaacnnga 180 

tgtgctttnt ntnaaanggc anaccagngn tgnnctgnnt ttaatgatgc agancctnac 240 

tttatccaca cctggcccgt ttnacatttn agtaangnac gatatttggc tgatggctga 300 

acantttctg aaatacacnt ttagtgtatg gaantacaag accnntaaag gnctgccagg 360 

ttancatctc atctngcatt cnnntccttt ggcnanaaag gganatntca gaattatatt 420 

tcttgatggg gtcttttcaa tcantgtatc tgtcgaaann tcttaganaa anctatgtgn 480 

tcncggtgtt gtctaaaaan atnctttcaa anatgacccc tggaattncc tgananangc 540 

ttaaacgtga gaagacnggt nggcaaaaca ccctncnaag gttnttggna angcccnant 600 

ntgttttgtc tggcccatat aancttngcn ccattnaagc cncgggngag ctttgnatnt 660 

atattngngg ngttactttc tttgnncctt tgcggggaac ancttnnata atgcttntcn 720 

ncccnanntg gacntttgct ttttgnnncc nnaccccccc aaagggngcn cacctccant 780 

gaaaaagtct ttttnnaaaa gggctccttn ctnaaaaaaa nnnnt 825 

<210> 517 
<211> 1444 
<212> DNA 
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<213> Homo sapiens 



<400> 517 

ctctcncnnc nnnncnnntc tctnncnntn nnnnntnntn nnnctcnnnn cnnnatctnn 60 

nnncnnctnn nnnnncntnn cntccntctc ttntntngct ctcntntctc ntncatcttn 120 

ccnctattnt cntnntnntc nntcntcnnn antnctnnnt tctncctnnc canctntcca 180 

tnntntactn tcnntnntct ggctnttnta tntggggggt ctatttnttn ncttaaatcg 24 0 

actngttcca agtctcntan cngcntctnt ctnnctntct ntgcnctncn ctggggcntt 300 

aattncccnn gctnttatan aagngngnaa ttaaggtntc nnntctanng ctntgcaagg 360 

ctaatgntta gatccngnta gaanncgnta catgttggga acngacanct tnctgcncaa 420 

agngggctna ggcanngnnn tntgcaaann ctcnnntntc nnancttgnn tcncgtagan 48 0 

cggnnncccc tgaattttnn ancnngganc nttaaatnnt ntngnggtac gannccncnn 54 0 

ncgnnnnnnc gnntannccn canngttaan tgcncccnna nnnantcaac tctntnntcc 600 

tnntnnaacn nnnttantct annatnntta cnnntnagnt tttcctcnct nacnnctctg 660 

tncttnttnn atcttntnct tctcncttna tttntatctc ntntntntnc tnccctnatc 720 

tatctnctac nctctnttcc ncttctccct nncntctctc atcatatccc acgcnactna 780 

ncccctctnn ctcttacctn nntnctctcn tcntatctcn nnaccctctt tctntntctt 840 

atnncnccta tcctctactt attctcctcc tattntncca ctcacccttc ntntntctnc 900 

nctnntcttn tnctatttnt actntcncta ttcctncntc tctnntgnct cccaccccct 960 

cttcctctcn ctctcctnnn nnnactactc tcaccntctc nnctntcnct ctacnnntnn 1020 

ananntcctt antttcctnc tcatcacant actcttccct ctcatnntca nanctaantt 1080 

ntnctctcac tctaccactc tntnctccac tcatatnana cttctatant nctaatccta 1140 

tcttcttaaa cntctcctct tatcnctcta anctcctctt cntcgctanc tccnntncaa 1200 

ctcgnaaatc tctccaatnc tnccccactc taaaaatnnc ncntcngant cccacttttc 1260 

ngngcanaat nnaacncnan tccnctccct ttagctatct ctctanaaac cccntttctc 1320 

aacaggnacc nccctntntc tcnaaatcct catnctncta ctttatatnt cnccaagcct 1380 

cncctntgta anagcatctc nctntccncc aatnnanatc tccctnctcc natanatntn 1440 

anat 1444 



<210> 518 
<211> 706 
<212> DNA 
<213> Homo sapiens 



<400> 518 

ctaatggctg gnngctcgtt ctttccgcaa cancccngcg antcgaattc ggcacgaggt 60 

ccgaagaaaa agactgtggt ggcggagatg ctctctccaa tggcatcaag aaacacagaa 120 

caagtttgcc ttctcctatg ttttccagaa atgacttcag tatctggagc atcctcagaa 180 

aatgtattgg aatggaacta tccaagatca cgatgccagt tatatttaat gagcctctga 240 

gcttcctaca gcgcctaact gaatacatgg agcatactta cctcatccac aaggccagtt 300 

cactctctga tcctgtggaa aggatgcagt gtgtagctgc gtttgctgta tctgctgttg 360 

cttctcagtg ggaacggact ggaaaacctt tcaacccact gctgggagag acttatgaat 420 

tagtgcgaga tgaccttgga tttagactca tctccgaaca ggtcagccat cacccaccaa 480 

tcagtgcatt tcatgctgaa ggattaaaca atgacttcat ctttcatggc tctatctatc 540 

ccaaactgaa attctggggg aagagtgtag aagcagaacc caaaggaacc atcaccttgg 600 

agctccttga acacaatgag gcatatacat ggacaaatcc cacctgctgt gtgcataata 660 

tcattgtggg taaactgtgg atcgaacagt atggcaatgt ggaaat 706 

<210> 519 
<211> 734 
<212> DNA 
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<213> Homo sapiens 



<400> 519 



tngtaccaat tatctgctgg ctanntagcc taaanagntt ggtcngggcg aattcggcac 
gagggnaaag cagnaagtaa tgagcttgtc cgtcagctgg tagctttcat tcgtnaaaga 
gataaaagag tgcaggcgca tcgaaaactt gtggaagaac agaatgcaga gaaggcgagg 
aaagccgaan agatgaggcg gcagcagaag ctaaagcagg ccaaactggt ggagcagtac 
agagaacaga gctggatgac tatggccaat ttggagaaag agctccagga gatggaggca 
cggtacgaga aggagtttgg agatggatcg gatgaaaatg aaatggaaga acatgaactc 
aaagatgagg aggatggtaa agacagtgat gaggccnagg acgctgagct ctatgatgac 
ctttactgtc cancatgtga caaatcnttc aagacanaaa atggccatga agaatcacga 
gaagtcnaan aagcatcggg aaatggtggc cttgctaaaa caacagctng angangaacg 
aagaaaattt ttcaagacct caaattgatt gaaaatccat tagatgacaa ttcttgagga 
agaaatgnga aagatgcacc aaaaacaana agctttctac acantnaaat ccnannaact 
ccatccntct anaactatnn gtgagtcctt nttacntcna tccagacatg antancnata 
cnattgatgg aacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
734 



<210> 520 
<211> 701 
<212> DNA 

<213> Homo sapiens 
<400> 520 

ctaatgctgg ctnttgttct ttttgcagga tcccatcgat tcgaattcgg cacgagccca 60 

catgtaccag gttgagtttg aagatggatc ccagatagca atgaagagag aggacatcta 120 

cactttagat gaagagttac ccaagagagt gaaagctcga ttttccacag cctctgacat 18 0 

gcgatttgaa gacacgtttt atggagcaga cattatccaa ggggagagaa agagacaaag 240 

agtgctgagc tccaggttta agaatgaata tgtggccgac cctgtatacc gcactttttt 300 

gaagagctct ttccagaaga agtgccagaa gagacagtag tctgcataca tcgctgcagg 360 

ccacagagca gcttgggttg gaagagagaa gatgaaggga catccttggg gctgtgccgt 420 

gagttttgct ggcataggtg acagggtgtg tctctgacag tggtaaatcg ggtttccaga 480 

gtttggtcac caaaaataca aaatacaccc aatgaattgg acgcagcaat ctgaaatcat 540 

ctctagtctt gctttcactt gtgagcagtt gtcttctatg atcccaaaga agttttctaa 600 

gtgaaaggaa atactagtga atcacccaca aggaaaagcc actgccacag aggaggcggg 660 

tccccttgtg cggcttangg ccctgtcagg aaacacacgg g 701 

<210> 521 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 521 

naacacttng ctacnngttc tttttgcagg atcccatcga ttcgaattcg gcacgaggag 60 

atctctggga tgtcagtgag gctggttgaa gaccagaggt aaactgcaga ggtcaccacc 120 

cccaccatgt cccaggtgat gtccagccca ctgctggcag gaggccatgc tgtcagcttg 180 

gcgccttgtg atgagcccag gaggaccctg cacccagcac ccagccccag cctgccaccc 240 

cagtgttctt actacaccac ggaaggctgg ggagcccagg ccctgatggc ccccgtgccc 300 

tgcatggggc cccctggccg actccagcaa gccccacagg tggaggccaa agccacctgc 360 

ttcctgccgt cccctggtga gaaggccttg gggaccccag aggaccttga ctcctacatt 420 

gacttctcac tggagagcct caatcagatg atcctggaac tggaccccac cttccagctg 480 

cttcccccag ggactggggg ctcccaggct gagctggccc agagcaccat gtcaatgaga 540 
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aagaaggagg aatctgaagc cttgggtaag gatttggggc acagtaccag gaggggggct 
tggtgccaga cctcatgagg aagaaggatt ttcctatgta cagagaaggg gacccctgtc 
ctgttgggan gtgctgtgca aacctaacca aagttactaa cccctctggt ttctgnggtt 
acacaaangg ggataaatac aaagctttnc ctnaactagc caattctatt tgggtttcct 
gagt 



600 
660 
720 
780 
784 



<210> 522 
<211> 719 
<212> DNA 

<213> Homo sapiens 
<400> 522 

ttctaatttn aatccttnaa atnggttctt tntgcaggat cccatcgatt cgaattcggc 60 

acgagagaac acaggtgtcg tgaaaactac ccctaaaagc caaaatggga aaggaaaaga 120 

ctcatatcaa cattgtcgtc attggacacg tagattcggg caagtccacc actactggcc 180 

atctgatcta taaatgcggt ggcatcgaca aaagaaccat tgaaaaattt gagaaggagg 24 0 

ctgctgagat gggaaagggc tccttcaagt atgcctgggt cttggataaa ctgaaagctg 300 

agcgtgaacg tggtatcacc attgatatct ccttgtggaa atttgagacc agcaagtact 360 

atgtgactat cattgatgcc ccaggacaca gagactttat caaaaacatg attacaggga 420 

catctcaggc tgactgtgct gtcctgattg ttgctgctgg tgttggtgaa tttgaagctg 4 80 

gtatctccaa gaatgggcag acccgagagc atgcccttct ggcttacaca ctgggtgtga 540 

aacaactaat tgtcggtgtt aacaaaatgg attccactga gccaccctac agccagaaga 600 

gatatgagga aattgttaag gaagtcagca cttacattaa gaaaattggc tacaaccccg 660 

acacagtanc atttgtgcca atttctggtt tggaatggtg acaacatgct ggagccaat 719 

<210> 523 
<211> 710 
<212> DNA 

<213> Homo sapiens 



<210> 524 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<400> 524 

ttnnnnnttt aancnttcaa atcnctaggc tacttgttct ttttgcagga tcccatcgat 60 



<400> 523 



tnnncttcaa atcgntngct 
cgagagatta tgagcatgta 
cagagagaca agatggtgaa 
ttcctgtacc tccaaagaga 
taatttcaga gagaggactt 
gtaaaggtca tgaggctgaa 
ataggctatt ccctaaactg 
gtaaaaagga agttcagacc 
aagattttgt tagcaataat 
ctgaattaga tccctttctg 
gtagaagcct aacagaagag 
aaagaaggca ggcaaagctg 



cttgttcttt ttgcaggatc ccatcgattc gaattcggca 
gaagatgaaa cttttcctcc tttcccacct ccagcctctc 
ggaactgagc ctgatgaaga gtcaggaaat ggagcacctg 
acagttaaaa gaaatatacc caagctggat gctcagagat 
ccagccttaa ggcatgtatt tgataaggca aaattcaaag 
gacttgaaga tgctaatcag acacatggag cactgggcac 
cagtttgagg attttattga cagagttgaa tacctgggaa 
tgtttaaaac gaattcgact tgatctccct attttacatg 
gatgaagttg cggagaataa tgaacatgat gtcacttcta 
acaaacttat ctgaaagtga gatgtttgct tctgagttaa 
caacaacaaa gaaattgaga gaaataaaca ctggccttgg 
ctgagtaata gtcagaccct aggaaatgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
710 
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tcgaattcgg 
agacactctt 
aggccctaaa 
acgaggagcc 
acgagaagtg 
accgggacct 
tccccctgcc 
gcatccccct 
ctgggggccc 
gtaaaaccta 
tntngnnntt 
attaccttag 



cacgagccca cactcggaca ctgtggaatt ctaccagcgc ctgtcgaccg 
cttcatcttc tactatctgg agggcactaa ggcacagtat ctggcagcca 
gaagcagtca tggegattcc acaccaagta catgatgtgg ttccagaggc 
caagaccatc actgacgagt ttgagcaggg cacctacatc tactttgact 
gggccagcgg aagaaggaag gcttcacctt tgagtaccgc tacctggagg 
ccagtgacac cggcccctnc ctctacccac ccccttcccc cgcatgctga 
caggtaaggg ccctgccctg gaagactgga gggaggcccc aagccacggg 
ctcccaggaa gcagggaggg ggccgggagg ttttcctctc aagccccacc 
gggggcgagg gctgccccct cctcccctcc ccagtgaggg acattttttg 
ttttcatttt ggaaaatatt tatgaataaa tagttttata tgaaaaaaat 
nnnatnnnan aataaaancn tcgnncctct taaaactata gtgaagtcgt 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
730 



<210> 525 
<211> 711 
<212> DNA 
<213> Homo sapiens 

<400> 525 

gcngntnttn antttcaaat cgctnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggataaa tacctcagcc cctcgccttc ctcaacccac ctggcaagtc 120 

ttcttaggat ctgatcccag ttttctggaa gcaatcctac cccagcccaa gcttcccaga 180 

gtcgagcctt aatccttctc acttctcagt gtcagagcag aaatgaatcc tggggttgac 240 

tgtgtccatt cgggttatta gcagctaaga agcccagacg agtagtgtga gctgccttgg 300 

gagcctcagt gagggcactg ggactggcct cactctcttg cccccagcct agtgggcttt 360 

ctcctctgtc tctccggtgg ccccaggcaa tcgactgcat cacgcaggga cgtgagttgg 420 

agcggccacg tgcctgccca ccagaggtct acgccatcat gcggggctgc tggcagcggg 480 

agccccagca acgccacagc atcaaggatg tgcacgcccg gctgcaagcc ctggcccagg 540 

cacctnctgt ctacctggat gtcctgggct agggggccgg ccaggggctg ggagtggtta 600 

gcccggaata ctggggcctg ccttagcatc ccccatagct tccacagccc cagggtgatc 660 

tcaaagtatc taattcacct taacatgtgg gaagggacag gtggggcttg g 711 

<210> 526 
<211> 692 
<212> DNA 
<213> Homo sapiens 

<400> 526 

tacangctac ttgttctttt tgcaggatcc catcgattcg aattcggcac gagagaacag 60 

ggagaagaga ggaagaggga gctgcaggtg ccagaagaga acagggcgga ctctcaggac 120 

gaaaagagtc aaaccttttt gggaaaatca gaggaagtaa ctggaaagca agaagatcat 180 

ggtataaagg agaaaggggt cccagtcagc gggcaggagg cgaaagagcc agagagttgg 240 

gatgggggca ggctgggggc agtgggaaga gcgaggagca gggaagagga gaatgagcat 3 00 

catgggcctt caatgcccgc tctgatagcc cctgaggact ctcctcactg tgacctgttt 360 

ccaggtgcct catatctcgt gactcagatt cccgggactc agacagagtc cagggctgag 420 

gaactgtccc ccgcagctct gtctcccttg ctagagccca tcagatgctc tcaccagccc 480 

atttctctac tgggctcctt tttgactgag gagtcacctg acaaggaaaa acttctatca 540 

gtactttgat atgtcacagt ttcatgttta tccagttcaa tgtattttta aatttttcct 600 

tgagacttct ttgactgata gattattgtg aatgtgtttt taaatttcca aatgtttang 660 

gattttcata tctttcttat gctgatttcc aa 692 
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<210> 527 
<211> 769 
<212> DNA 

<213> Homo sapiens 
<400> 527 

gttctngttc tttttgcagg atccctcgat tcgaattcgg cacgaggcca agcctcggcc 60 

tccactgcac ctgctgcgga gtgggcacct ttgcctgcaa ggccttttnc ccantgncca 120 

atggtanttt aaccagggtt tttgncnntt aaggaggcct tngtggtggg tngttaatct 180 

ggccnttccn tattgaaaag ctcctgttat tgtccacaga ccagaaggac ttgtaacctt 240 

ggtcccacag tctgacttng gcttttcaag cacccagaaa acttagaggg aatcttatag 3 00 

attccagaac ttaaggatac ctcaagggat agggtcacag ccaagaagtn caaaggaatc 360 

ttcagtctgg aacaaaaaca gaaccctttc atgattgaca aangtcactt tctgtttgcc 420 

tggaccaagc tactncagat catctgacca actcttaaaa atcacggcca ggcacagtgg 480 

ctcatgcctg taatcccagc actttgggaa gcaaaagtgg caggatcatt ncagcccaag 540 

agttcaagac cagcctgggc aacacagtga gtgagaccct gctctattta agaaaaatna 600 

ttaagaaatt tattaaaaaa gaagaatcag gaaaccaagt ncaacccaac ttaacctcaa 660 

tgaaccagcc cctaacacag atgangggat ttgggactga taagctctgt gctgngtcca 720 

tggcccgtca nttatcaagg ttgcactttt aaatgnggta tttttatgn 769 

<210> 528 
<211> 757 
<212> DNA 

<213> Homo sapiens 
<400> 528 

tnaatatcag ctcttgttct ttttgcagga tccctcgatt cgcangaggg tgttcgactg 60 

ctngagccna gcgaancgat gcctaaatca anggaacttg nttcttcaag ctcttctggc 120 

ngngattctg acagtgaggt tgacananag ntaancagga aaaacaagtn gctccagaaa 180 

ancctgtaca gaaacataag acaggtgana cttcgagagc cctgtcatct tctaaacaga 240 

gcagcatcng cagagatnat nacatgtntc atattgggaa aatgaggcac gttantgttc 300 

gcnattttaa aggcaaagtg ctaattgata ttanagaata ttgnatggat cctgaaggtg 360 

aaatgaaacc aggaagaaaa ggtatttctt taaatccana acantggagc cagctgaang 420 

aacagattct gacattgatg atgcagtaag aaactgtgaa attcgagcca tataaataaa 480 

acctgtactg tctagttgnt ntaatctgtc tttttacatt ggcttttgtt nnctnaatgt 540 

tctccangct attgtatgtt tggattgcag angaatttgn angatgaata cttnntttta 600 

atgngcatta ttaaaaatat tgagtgaagc tnatngtcaa ctttattaag gattactttg 660 

ctgccaccac ctagtgtcaa ataaaatcaa gtaatacaat cttaataaac ntttaaacta 720 

taaaaactcg acccttagac ctatantnag tcggttn 757 

<210> 529 
<211> 821 
<212> DNA 

<213> Homo sapiens 
<400> 529 

tnannnannc annnnnnnnn nnnnntttga agccattgct acttgttctt tttgcaggat 60 

cccatcgatt cgaattcggc acgagagcaa ttccactcct agctccaccc acaggaaatt 120 

gaaagcaaag acgcaaacag atgcctgtgc accaaagttc acgggcaagc atccttcggc 180 

cttaatgggc agcattccgt cgtcacaagc gggcattcat cctttcatca atagcgggca 240 

gcattccgtc gtcacaagcg ggcagcattc ctttcgccac aagcgggcag catcttgtcc 300 
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gtcacaagcg ggcagcatcc ttcgccaaag cgggcaagca tccttcgtca tagcggcagc 360 

atcctttgcc atagcgggca aggtggaaac cctgtccatc cactgaggcg tgcatagact 420 

aaacatggcc agtccaggca ctggaatcca ggcccgtaga acggcgccca cggtcaaaag 480 

gaatgagacc ctgatgcact gggcgacaca gacgggcgac acagacttgg agacatcatg 540 

ctaagtgaaa agccaggcac acggagcgga cggcgtgatc ctgctcacgt gatgtgtccc 600 

gaatgggcac gttcagaggg aagaagggag atggcgcttg ccggtgcccg gggacngggg 660 

ttgggagcga cggttgctgg tttggggttt ctttctgggg tgangaantg gttttgatat 720 

ttggnccgtt ggtgatgttt gcatacctct gaatatgctt aaganccaca gaattgacca 780 

ctttaaatgg atgaattgna tggtattggg aattacccaa n 821 

<210> 530 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 530 

gnntttnnnn nnnnnnnttt tatnnntaca gctacttgtt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagac taccccggct acggttcccc catgcctggc agcttggcca 12 0 

tgggcccggt cacgaacaaa acgggcctgg acgcctcgcc cttgcccgca gatacctcct 180 

actaccangg ggtgtactcc ggcccattat gaactccttt aagaaagacg acggcttcag 240 

cccggtaact ctggcacccc ggatcgagga caagtgagag agcaagtggg ggtcgagact 3 00 

ttggggagac ggtgttgcag agacgcaagg gagaagaaat ccataacacc cccaccccaa 360 

cacccccaag acagcagtct tcttacccgc tgcagcccgt ccgtccaaac agagggccac 420 

acagataccc cacgttctat ataaggagga aaacgggaaa gaatataaag ttaaaaaaaa 480 

gcctccggtt tccactactg tgtagactcc tgcttcttca agcacctgca gattctgatt 540 

ttttggtggt gtgtctcctn cattgctgtt gttgcaggga agtcttactt aaaaaaaaaa 600 

aaattttgtg agtgactcgg tgtaaaacca tgtagtttaa cagaaccaga nggttgacta 660 

ttgttaaaaa caggaaaaaa ataatgtaag gtctgttgta aatgaccaan aaaaaaaaaa 720 

aaactcngcc tntaaactnt tntgagtcgt nttcgtaaat ccaan 765 

<210> 531 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 531 

gnntttnnnn nnnnnnnttt taagntactg ctacttgttc tttttgcagg atcccatcga 60 

ttcgaattcg gcacgaggtt cttcaaagcc aaccaagaca ggcttagcag ttttagagct 120 

tcagaacaaa ttgccaaaag ccagagttgt ttatgctagt gcaactgggt gcttctgaac 180 

cacgcaacat ggcctatatg aaccgcttgg catatggggt gaggggtact ccatttagag 240 

aattcaagtg attttattca agcagtagaa cggagaggag ttggtgccat ggaaatagtt 300 

gctatggata tgaagcttag aggaatgtac attgctcgac aactgagctt tactggagtg 360 

accttcaaan ttgaggaagt tcttctttct cagagctacg ttaaaatgta taacaaagct 420 

gtcaagctgt nggtcattgn cagagagccg gntcagcaag ctgcagatct gattgatgct 480 

gancaacgaa tgaagaagtn catgtggggt cagttctggc tgtcaccaga ggttcttcaa 540 

atacttatgc atagcatcca aagttaaaag ggttgtgcac tagctcgaga ggaaatcang 600 

aatggaaaat gtgtngtaat tggctgcagt ctcaggagaa gctnnaacat tagaactttn 660 

gaagaaggcn ggggagaatt gatganttgg ttcaactgcc aaagtgtgtg cantcactca 720 

ttggaaaaca tttnctgctc cagcngggaa aacttatggt tacttggn 768 

<210> 532 
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<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 532 

cgtnttttnn nnccnannga aagcccttgg ctacttgntc tttttgcagg atcccatcga 60 

ttcgaattcg gcacgaggat cagcccacct cggcctcaca aagtgntggg attacaggcg 120 

tgagccacct tgcccaccca catcatacag ttgaaatgaa actttgccac aaccagcctt 180 

tgctgtacac acacatatat cactgaacct ggttgaaata aagntttttt tctttttcct 240 

ctggtattct gggttctgaa gtctggtatt ctggtattct gggttcaaaa gtatgacttg 300 

agagtgttgc tctggtattc tgagagttgc tctgtattct gggttctgaa gattatttga 360 

aaaataactc ctactacatt gaaatgcaga cttaaaaatt taaacattgg attaggcagt 420 

caaaaaaacc aagcaagcat aaaaggtcaa taagttgtaa tcttgatagt aaaggtggaa 480 

aacttattat aaatggaaag aaagtttatt tccttttttg gttgatgggc agtatgccat 540 

attataccca aagttctttt aaaaaatatt tccatcacca tttttattta aaataaacat 600 

ttgagggaag taccaaggca gcttttttcc tcaaaagtac ctggtcctct ttgggaatag 660 

cacattttan gggcattggg taatcctgag attttactca ntaaatcctg atggtactgg 720 

gtgtaaaata tctttagtng gattgaaggc cttgnggggg a 761 

<210> 533 
<211> 735 
<212> DNA 

<213> Homo sapiens 



<400> 533 

taaacatcng gctacttgtt ctttttgcag ggatcccatc gattcgaatt cggcacgaga 60 

cactgtccca ctccatcacc caggctggag tccagtggtg tgatcatagc tcgctgcatc 120 

ctccagttcc tgggttcaag ccatccctcc tgcctcagcc tccccagtag ctggaactac 180 

aggtgtgtgc catcacacct ggctttacat ttttctgtgg ggtcttacta tgttgcccag 240 

gccggtctca aactcctgag ctcaagtgat cctctgcctc agcctccaga gtatctggga 300 

ttacatatgt cggctaccgt gtctggccgt tcacatcttt ggccactatt tgcttgtgaa 360 

aaggtataat gaggtggtac ttatcatttt tactgngtct catgttttgt atatttttgt 420 

ttcatcaact aagatgcact gtaacatctc tgaaatctgg atatattatc aatggtttat 480 

catagttttg ttagcaatac actgtctttt agtggtgcct aaaataatgg tatagttgtg 540 

aggtgatctt agatttgatg aagcacagta tgcaggtagg cctaatgggg gaagatggta 600 

atataaaagc aagaagtatt ttttttttgt aatgactgaa agctgtctgt ggatgaccta 660 

ccctttnctt taaacacgat tntntcactt ncaactncaa acttgctcaa ctaatncttt 720 

aaaaataact tgagc 735 



<210> 534 
<211> 735 
<212> DNA 

<213> Homo sapiens 



<400> 534 

natngnttgc tcctngttct ttttgcagga tcccatcgat tcgagacaac ccagaaacaa 60 

attcatacat ctatggtgac cacttttgac aaaggaatga agaacataca ctgggggaaa 120 

agataatgtc tttaataaat ggtgctggga aaactggntn tccantntgc agaagaatga 180 

aactagaccc ccatctctta gcatatacaa aaatcaaaat taattaaaaa gttaaatcta 240 

agacctcaaa ctatgaaaca gctaaaagaa aacatcgggg aatctctcca ggacattgga 300 

gtgggcaaag atttcttgtg taatacctga caaacaggca accaaagcaa aagtggacaa 360 
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atgggatcac atcaagttaa aaatcttctg cattgcaaag gaaataacaa agtgaagaga 
cacccataga atgtgagata atatttgcaa actatccatc tgtattaggc catttttgaa 
gtctacaaag aaatacttga gactgagtaa tttataaaga agaggtttaa ttggctcacg 
gttttgcagg ctgtcaggaa gcatggtgct aacatctgat cagcttgtag ggaggcatca 
ggaagtttcc acccatggtg gangcaaaag gggaataagt ttctccatgg caggtgcagg 
gcaaaaanan gggggaaggg aagtgccnca caaccagatc ttgtgagtnc tcagatttgn 
ggngggngct tgngg 



420 
480 
540 
600 
660 
720 
735 



<210> 535 
<211> 735 
<212> DNA 

<213> Homo sapiens 
<400> 535 

tnaannanag ctacttgttc tttttgcagg atcccatcga ttcgaattcg gcacgaggtc 60 

catacatgga gctccctgga cccgtgtgct ctcgtgtgac tgaacgtttt gtgatgaaag 120 

gaggagaggc tgtctgcctt tatgaggagc cagtgtctga attgctgagg agatgtggga 180 

attgcacacg ggaaagctgt gtggtttcct tttacctttc agctgaccat gaactcctga 24 0 

gcccgaccaa ctaccacttc ctgtcctcac cgaaggaggc cgtggggctc tgcaaggcgc 300 

agatcactgc catcatctct cagcaaggtg acatatttgt ttttgacctg gagacctcag 360 

ctgtcgctcc ctttgtttgg ttggatgtag gaagcatccc agggagattt agtgacaatg 420 

gtttcctcat gactgagaag acacgaacta tattatttta cccttgggag cccaccagca 48 0 

agaatgagtt ggagcaatct tttcatgtga cctccttaac agatatttac tgaaggaatc 540 

taggttgtat tttcagtgga caatgggaat aaagcatttc taaagcaccg actggagagg 600 

aaggcaacag aaacaaggag agaagcccga gagacatgtc tgcgtgctgc cacgcatctg 660 

ancgattgct cttgtgaaga gtttgtcact gaacattttc aggggaggct gtttacccag 720 

cnatgtnctn aacan 735 

<210> 536 
<211> 785 
<212> DNA 

<213> Homo sapiens 
<400> 536 

gccccccnnn nnnnnnnttt tcaaannccn ttnnnnnnnn nngnnnnttt tannnnnttn 60 

ttannnnaca gctcttgttc tttttgcagg atccctcgat tcgattcggc acgagctacc 120 

ttgggctggc cctctatnat gctntgaggg gagctgggac agatgatcnt nccctcntca 180 

gngtcatggn tnccangngt gagnttnatc tgccnnacat ngtgacggag tttaggaaga 240 

atgntgccnc ctctntttat tccatgatta aggganatcc atnnggggac tataagaaaa 300 

gcnnttttnc tgctntgngg ncaanangan tnacnngncc cgggnnanag ctcctatgct 360 

gtntgcctgc accaccccct gccttccttc atacctttcc ntggatatgn atgccagggc 420 

ttnncacatt gcctnattna tactnacntg ctnatgacca anacatncac gtgataacac 480 

aaacantggg tgcttgnttc tgatcnctag aggnganctn ttggnnngnt ggagnactna 54 0 

antnttctna gtgtnacttn agttcaatgc ctggccatnt gcnatnacct tatatcntnc 600 

aaagaggcta ctgtgctttt ancctttttt aaaacctcca tctgtattac attgnnaacc 660 

angtttcttt aatnaggagc ttgacctcta nantgggaac tcttgggaat ggncttagtg 720 

aagttcgcna ctaacttaac ctgaaaatta tnatgnnctg tttnacctat catgttnata 780 

actnt 785 



<210> 537 
<211> 967 
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<212> DNA 

<213> Homo sapiens 



<400> 537 



agtanggcgn ttcctaatnn 
aataccgnnc gaggggggat 
atntctttac cccccnannn 
cacacacang caagcactnn 
gtagacatnt nagacacgtc 
ngcanacnca gacggcacnn 
nncgntgggc cngtagtagt 
acgnnctttn aactaaaaan 
antnncttga gngggatgaa 
aattganatt attncntant 
tnaaanttnn ctnttttnan 
ggtanggann naattgtncc 
atnngngngg nccntntatg 
gntaacanan gnacnatggn 
nnnactatgn naaatattct 
tntgcatagn nactatncnt 
gcttnnn 



annnggctaa gcgactttna aagangaggc tngcgtgntg 
nacaatagta nacnnggtnc caatncatgc ttaacaccgc 
ncacanatgc agacncacac atngcanncg nacacncaga 
catgcatggc ccatgctcac acacntgnan nnaacatgcn 
atgtnacaca tgnnacacan gnnnaanaca ctgctttnca 
ngagacanac atgcnnaaac aacatgctcn ctcacntnna 
gtactgtggg tgnnactggg tgccatcnac nnngtatttt 
cttggagcct tnanttnntn tggtgantnc aatncctana 
ccctaananc ctggccctnn tnccnctttc aaggccnagn 
ngnncacgaa gcttntggta ncangngncc cgagnnctnt 
aatnaaacat tttancggtt ctnaggancc gngcctncng 
tgggnatagt tctcacaant natnttnaag gggnnaagng 
nggcnngcca annaangggg tcgnngttaa natattccaa 
accnatccct ntnngaagna aggaactncc tgnncgacta 
cacatntaca naaaaagnag gnnccnnggt ncttnaagnt 
gggacgngtt aacnnanatt ntatgcttta nnngatnggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
967 



<210> 538 
<211> 892 
<212> DNA 
<213> Homo sapiens 

<400> 538 

gctagttnga agaggtgttt ctaangnntn ggaatcgaca tctnnnnagg cngnccntgc 60 

gattcgcttt gctctctcca ttccaagttg ttctctgttc tagaaagcng atgraigggnt 120 

acatctactg tttttgccta aacagaatcc ctttntcctt tttttgttaa aaggctcatn 180 

cctaatatta cattgctctg gaacgantga caataccana actcagcacc ntgatcggac 240 

cgggacaatc agattatcta attcctcagc aaacggagat cgatccgaaa agtggaaata 300 

tganctcntn ctttgtgntg gcatatggac cctgagagaa agaaacttta atcttttact 360 

cttggactgc aatnaagtnt agctgcctaa aaatcnnttt cntgacactt ngnaggtttg 42 0 

tccacaatcg ggngaaatta nngggtnnga cntaancact ggatgaaaaa aaatnccgnt 480 

tanttntatt ncnnttccan ncttntnaaa tanananttt ntcanccttn nntaatacta 540 

ttanntatat ntnttnnncc cnnatnnncc ttcttnctcc tacnncnntn cnatntnnnn 600 

nnangntcnn cnannnnttc tnttatttct annatatntc ntancnttna ctaaaacctc 660 

cnctcgtnna nattncnnta taatattntc tctaganntt ntnntntntt gnnncttaaa 72 0 

anctcntcta tccctantat nantnattct taccatnaaa tacactanaa gtnntntcac 780 

gagacncgnt atgttantnc anactataat cgcttncatn tanntatatn taaaantgct 840 

atncagnnag nngntnttat atntttanct ngnnaggnta tcctcnatan cc 892 

<210> 539 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<400> 539 

gnnnaggttn tagancagct cttgttcntt gngcaggatc cctcgattcg aattcggcac 60 
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gagagtgtca gttttcctaa tctcagtcca ggtaggaatt aagaaatatc tcaagtgttg 120 

atgctatcca agcatgttgg ggtggaaggg aattggtgcc cagaaaatgg gactggagtg 180 

aggaatatct tttcttttga gagtaccccc agtttatttc tactgtgctt tattgctact 240 

gttctttatt gtgaatgttg taacatttta aaaatgtttt gccatagctt tttaggactt 300 

ggtgttaaag gagccagtgg tctctctggg tgggtactat aatgagttat tgtgacccac 360 

agctgtgtgg gaccacatca cttgttaata acacaacctt taaagtaacc catcttccag 420 

gggggttcct tcatgttgcc actccttttt aaggacaaac tcaggcaagg agcatgtttt 48 0 

tttgntattt acaaaatcta gcagactgtg ggtatccata ttttaattgt cgggtgacac 540 

atgttcttgg taactaaact caaatatgtc ttttctcata tatgttgctg atggttttaa 600 

taaatgtcaa agttctcctg ttaaaaaaaa aaaaaaaaaa actcgancct ntanactata 660 

gtgagtccnt attacgtaga tccagacatg atnagatcat tgatgaattt ggaccaaccc 720 

aactagaatg cagtgaaaaa aatgcttttn t 751 

<210> 540 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 540 

gntnggntcn agancagcta cttgttcttt tgcaggatcc ctcgattcga attcggcacg 60 

agcctgcagc cactaatgca ttgtgtatga taacaaaaac tctggtatga cacattttct 120 

gtgatcattg ttaattagtg acatagtaac atctgtagca gctggttagt aaacctcatg 180 

tgggggtggg gtgggggtgt attccttggg ggatggtttg ggccgaatgg ggagtggaat 240 

atttgacatt tttcctgttt taaattctag gatagatttt aacatccttt gcggtcccag 300 

tccaaggtag gctggtgtca tagtcttctc actcctaatc catgaccact gtttttttcc 360 

tatttatatc accaggtagc ccactgagtt aatatttaag ttgtcaatag ataagtgtcc 420 

ctgttttgtg gcataatata actgaatttc atgagaagat ttattccacc aggggtattt 480 

cagctttgaa accaaatctg tgtatctaat actaaccaat ctgttggatg tgggttttaa 540 

aaaatgtttg ctaactaccc aagtnagatt tactggatta aatggccctt cgggtctgaa 600 

aaagcttttt taacttcttn gcttaaaatg ccgtttaatt ttgataagat ncttnaaatn 660 

gcctccaaaa gtgttananc caatcatttn aaataaaccn ggntgtatat tgcatnatgt 720 

gtacatgcnt atncccttct ggttaaaact naaaaaaaaa t 761 

<210> 541 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 541 

ggtttanttt aaatccntnc ncagctactt gttctttttg caggatccca tcgattcgaa 60 

ttcggcacga gcggacccat cggagcgtaa cctggatctc cgcaggcctg gcggaggccg 120 

gccacctgga ggggcattgc ttggttcgcg tggtancaga ggagcttgag aatgttcgca 180 

tcttaccaca tacagttctt tacatggctg attcagaaac tttcattagt ctggaagagt 240 

gtcgtggcca taagagagca aggaaaagaa ctagtatgga aacagcactt gcccttgaga 300 

agctattccc caaacaatgc caagtccttg ggattgtgac cccaggaatt gtagtgactc 360 

caatgggatc angtagcaat cgacctcagg aaatagaaat tggagaatct ggttttgctt 420 

tattattccc ttcaaattga aggaataaaa atncaaccct ttcattttat taaggatcca 480 

aagaatttaa cattagaaag acatnaactt actgaagtag gtcttttaga taccctgaac 540 

ttcgtgtggt cttgnctttg gttataattg ctgtaaggtg ggagccagta attatctgca 600 

gcaagtagtc acncttttca gtgatatgaa tatcatcttt ggcttggang ccantngaca 660 

acctgncatt actgactttt tgaaaanaac cctctggata ttgatgcctc gggtgtggtt 720 
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ggactgncat ttagtggacc ccgaatcc 748 

<210> 542 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 542 

gtnnnnntng tgtaatcgct tggctgcagg atccctcgat ggcgaattcg gcacgaggtg 60 

ttgctcaang agcagacccg actccntaag gtcatcattg aatgggcatn atangtttga 120 

anactgtcca ananantang ngtcaataca tcaacnnctt tanntgcttg atattgnnat 180 

tgaanaacac angnctcngn ctagttcgcc tganatgatg tttaagatac tccggaagga 240 

gacanantgt tntgantgcg gattaganac cacngaagnn acactnaagg ancancatct 300 

ccacctngna actgnattnn cngaccanaa aagngaactg gaccaaatgc tctcaaaggt 360 

gctggcagct taanagcgtg ttangactct gcacgaagan gacaggtnnt ntgagagcct 42 0 

ggnnannaca ctctcccaaa ctaaactgna nctttcaaca nangggancc ccannttggt 480 

ggagaaatca ggtganctgt tggcccttcc acaaagangc aaattctntg agggcnagac 540 

ttnanccttt ttgcngaacc agtncttgac tgactaaatg aaagcttttt aagccaggtg 600 

gcccancctt aangaagcna ctttttaatc cancggaacc ngcttgagan aaaaccnttt 660 

ttgacccaaa accnggagaa ccagctggcc taccaaaggg aaatgggccc ccatttgaac 72 0 

ttggggttnc ccangaacaa nccttgnccg ggncaaagcc cnttgttgga aaggacctca 780 

acct 784 

<210> 543 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<400> 543 

ntantaaatc ccttgctctt gttctttntg caggatccca tcgattcgaa tncggcacga 60 

ggacccggcg gcgcggacag gcttgctgct tcctcctcct nngactcacc attncaganc 120 

agaanntgaa aaaatggnng anctcaccca ggtaanggat gatgaagtnt tnatggctnn 180 

tgcatactat gcannanttn tncttntgna aatgatgcnt atgagtactg taanngnntt 240 

ctatncattg ncaagaangg ntnttgncaa tncatangac tgtgtagcat tcggcanagg 300 

agaaaatgnc aagaactatc ttcgaacaga tgacanagtg taacgggtac gcagagncca 360 

cctgaatgac cttgaaaata tnattccatt ncttgnaatt ggcatnctgt attccttgag 420 

tggtcccgac ccctctacag cnntcctgta ctttagacta tntgtcggag cncggntcta 480 

ccacaccatg tgcatatttg acaccccttt cnnatccaaa tatagctatg actttttttn 54 0 

gtaggatatg gannactctt tccatggctt acacgntgcn gtaaagtaaa ttggccctgt 600 

gcagaaaaac attccactca gtnttccaan tggcttntta aggaattctn gaccttgcaa 660 

ttnatantgg agnnctttcc ttaagattta aaggtttgan ggngagccnn aggaattntn 720 

aaccnggggt aaaccctttt tggaattttn agcnttgnca anaa 764 

<210> 544 
<211> 755 
<212> DNA 

<213> Homo sapiens 
<400> 544 

gatgctggnt ncnnatgctt gnngatccct cgattcgaat tcggcacgag gaaatgtgta 60 

tttcagtgac aatttcgtgg tctttttaga ggtatattcc aaaatttcct tgtattttta 120 
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ggttatgcaa 
atacaggata 
caaagtttga 
ggaaacattt 
tgagagatcc 
attttacatc 
tgaaggtgaa 
aaattcaaag 
atatgtcnta 
gctgnaatgc 
aactttcata 



ctaataaaaa ctaccttaca ttaattaatt acagttttct acacatggta 
tgctactgat ttaggaagtt tttaagttca tggtattctc ttgattccaa 
ttttctcttg tattacattt tttatttttc aaattggatg ataatttctt 
tttatgtttt agtaaacagt atttttttgn tgtttcaaac tgaagtttac 
atcaaattga acaatctgtt gtaatttaaa attttggcca cttttttcag 
attcttgctg aacttcaact tgaaattgtn ttttnttttc tttttggatg 
cattcctgat ttttgctgat gtgaaaaagc cttggtattt tacattttga 
aagcttaata taaaaggttg cattctctca ggaaaaagcc atcttcttgn 
aatgtatttt tgncctcata taccggaaag ttcttaattg gattttacca 
tttganggtt ttaaaaataa taacattttt aataattttt taaaaggaca 
atnatcccgg ngntcctttn ccrmn 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



<210> 545 
<2ll> 767 
<212> DNA 

<213> Homo sapiens 
<400> 545 

agnttttnaa tcctttggcc antcgcnctt tntgcangat cccatcgatt cgaattcggc 60 

acgagaaaaa gtnaagcttt tcatgagcac anntnccttg cattgttnga tgttactgat 120 

attcgtaaaa tgaatatttt ctgttttgtt ctgttnnatt tttttgagac aagtcttgct 180 

ttgttgccca ggctggagtg caatggcatg atcttggctc actgnaaccc ctgccttgcg 240 

agttcaagtg attcttctgc ctnagnctcc tgagtagctg ggattacagg cgctcaccac 300 

cacacccagc taatttctgt cttttnagtn gacacagggt tttaccatgn tggccaggct 360 

ggtctcaaac tnctgacctg aaactnctca caccngtnat ctcagcactt tgggaggctg 420 

angtggaaag gatcacttga agccatgagt ttgagaccag cctgngcnac acagcngaga 480 

ccccngtgnt gtacaaaagc ttncnacatt tanctggctg aggagtnnct cacccntaac 540 

ttccancnan tcnnttaagc nnanncatnt tgaacacntg agcccannta nggtcgatgc 600 

tnntagtnaa ccgtgactgg accacttaca gtccaagccc gggtngcctt ataaaagaan 660 

cggaaaacat ttcnttaatt cgggttnnag cnttanctat ttcggaatnc cttgngtttt 720 

naaaaacttg aatctccaan aaacagggtt ttttcttttg gnccann 767 

<210> 546 
<211> 989 
<212> DNA 

<213> Homo sapiens 



<400> 546 



tncccttgtt 
tgaggcaatt 
aagttggggg 
gtccttttac 
agaaanttgc 
nttncnnccc 
cttnantnta 
tnncntnncc 
atgncttccg 
nctccngctc 
ttancnncnt 
aatcgcgnna 
tcatctcccg 



gaaanccctt 
aaactggaaa 
actgaaggag 
agatataata 
agnttnacaa 
ttatngnaca 
ctcannagnn 
nnatancacc 
aatggantct 
nccctcattt 
ntncnttncn 
aataagtntt 
cnncactcac 



tgctcctttn tnctnccgtt tgncatncna ttcgctcagc 
agaaatagat tgaaaagata ctntngaaga agcagtacag 
agggagccac tgcaggtgct agctgcttaa ggggatacca 
gatacagctt ctgaggtgga gggtgatagg agtgtgtatg 
ctgctcntgc ctcctnggca anaggannan cntttcnccn 
cacattgncc tgattggncn tnccncngct agcttncagt 
nntnggggaa cncnctntcn nantatgntc ccttttcctc 
ccnctcnctt tcctttctaa acttncacan ntccctgana 
tngaatttct ncgcccctnc ntcntcataa tcnttttgct 
tnctacgtnc cnccttctnn ttnactgnct ttaaatntta 
atctncaant tttcnnnccn acnnnntttt nctnntnnca 
gcncactcnn ntnctancnt attntccctc gcnnttntcn 
ntnnncnnnt caattnntnn nnacncncnc tgctctacnn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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ncnatntctn tncctncaca ccctntancn tntcnctcan aatgcctttt ctnccttann 
nctntcnttc ncnnatctan ccaantttnc tttnacatcc cctncnnntc tnncccgacn 
atatntnacc tcttnnatcn cagngcntan natcnccccn ttntcnctnt cnctctcann 
cttntnttna tcttcatnna tcanncncc 



840 
900 
960 
989 



<210> 547 
<211> 781 
<212> DNA 
<213> Homo sapiens 

<400> 547 

tgtnnctttn cnncctcnnc cgaaatcnct ttgnttctaa ctttcctaat tacctgggct 60 

acttgcacta tcccntcgat ncgcatagat ggccnngtta ctaanggtga ntttccagcg 120 

cggggggcac gtggagtcac tggaacattt gngcaatgct ggtgggaatg tcaacccgng 180 

cnggcctctg gaatangcct ggcnnntcct gcnagagtta ccntgtgacc cagcaattcc 240 

actcctagct ccacccacag gantngaaag cnaagacgca nacagatgcc tgngcnccaa 300 

anttcacggc agcatcctnc gccatantgg cancatccgt cgtnacagcg gcatcatcct 360 

tcatcattac ggcancatcc gtcgtaacag cggctacatc acttcgccac agnggcagca 420 

tctgtngtca cagnggcngc anccttngcc aaagcggcag cntccttcgt catagcggna 480 

ncatnctttg ccatancngc naggtggaaa ccctgnccat ccactgaggc ntncatanac 54 0 

tanncatggn cagtccaggg cactggaanc cangccgtng aacggcgccn acggtnanna 600 

ggaatganac cntgatgcnc tggggccana catactggct anacanactt ggagacatca 660 

tgcttanttg nannnccant cacacttgcn nncggcgtna tcctgctcac gtgatncgac 720 

ccgaatgggc acttcaaatg ggaanaaggg ngatggcact nccggtnncc tnganagggg 780 

n 781 

<210> 548 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 548 

tctaaacgct tggnncttgc tctttctnca ngnanccnnt gcgntncgaa ttcggcacga 60 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 120 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 180 

gttaggaaat ggcatctcat tgttttcatc t&aatttgcg tcagcctgat tactcattga 24 0 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 300 

gaaatggttg ttcatgtcat ttgctcattt ttatattaga ttgtttttct tttttccagc 360 

tgacttgtag gaactctaca tcttatcaat attaatcatt tatcgaaaac tatttgggtg 42 0 

ccattatctt ctcctagtca atgttttttg tttgtgatat cttttataat atataagttt 480 

ttaatgttgg cagaagtaaa gttaatcttt ttggctgtgt tgtgtgtctt gtttgatgta 540 

aagatagttt ctgtaatagt tttgcagttt gattgntcat ctttaggtct tcaattcaac 600 

ctgcacatcc atcccctcta tcctctttct tactctgttt ttctccatac cacttatcat 660 

ccaataatat ggtcatgccc tttattnacc ngntttgcat atataatttg gcttgtnccc 720 

ggttccttcc ctana 735 

<210> 549 
<211> 812 
<212> DNA 

<213> Homo sapiens 
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<400> 549 



ttctaatact tggctctngt 
ggaaggagcg ggcgtgaggc 
ggccagctga ggcatggcga 
gtgacccctg ggaaggancg 
gggacttggg ggccgacctt 
aggacggcgt tgggatangc 
tncttggtca aggcttcang 
ccttccatgg nctctgatgt 
atcttgggan gattangccc 
gcttgtttaa cnggacaact 
naanttcncc ttaaaccttg 
tttccaaaaa ggggtncccc 
gggaaaaaat gccttncatt 
aaaatggaat gaattttccc 



tctttcngca ggatcccatc gattcgaatt cggcacgagg 
cagctgaggc atggtgaccc ctgggaagga gcgggcgtga 
cccctgggaa ggancgggcg tgaggccagc ttgaggcatg 
gncgtgaggc cagctgaggc atggtgaccc ctgggtacgg 
ggtttgccca gggcccctnc tgcaccacgg ccacatgcgg 
tccctgggtc cacagcttct gcccgtgtat tggggaaccc 
ctcttggcag atggggcaag gaaccctgag gcttccgcgc 
gggacacttg aacgangcac gattctgaag gactccatgg 
accttcngtt ggtggncnaa agccgtcctt ncggggcccg 
tttcnggtcg ggcttgttgg gccccaatcn ttgggttggg 
ggcccgncct tttaaccttt tttcccaatc ttttgacctt 
tgggcttttt ngggncaatt ggttccgggg gccaaaggtt 
gggnaaaacc ctggatccct tgttaancct ttgggagntt 
cccgggcttt tt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
812 



<210> 550 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 550 

ggnnantcna tgctggtctt gtctctntct aaaagttggc nattcgaatt cggcacgagg 60 

ttctgtggct ggcatggtct gcctgctact ggagagatct cctgagantt cagttttgga 120 

ttggtgctgt catcttcctg ggaatgcttg anaaagctgt cttctntgcg gaatttcaga 180 

ntntccgntc caaaggagaa tntgtccagg gtgctttgat ccttgcaaag ctgctttcan 240 

cagtgaaacg ctnactggct cgaaccctgg catcatagtc agtctgggat atggcatcgt 300 

caagccacgc cttggagtca ctcttcataa ggttgtagta ncaggagccc tctatctttt 360 

gtnctctgca tggaaggggt cctcagagta ctgggtattt tncttatccc ttgactctga 420 

tagtaaacct ggccctntca gcagtttgac gcctgggtat ttatggatat taattagcct 480 

gactcaaaca atgaagcttt taaaacttcg gaggaacatt gtaaaactct ctttgtatcg 540 

gcatttcacc aacacgctta tttggcagtg gcagcatcca ttgggttaat catctggaca 600 

acccatgaag tcaanaatag tgacatgtca ntcggactgg ccggnagctn ttgggtagac 660 

catgccatnt ggcgccttgc tggtcttcca tgancccctt tggcaatcat gggtcttntg 720 

gcgaaccatt ttgcaaacaa ct 742 

<210> 551 
<211> 736 
<212> DNA 

<213> Homo sapiens 
<400> 551 

agtctaatgc tggtcttgtc ttttctaatg ctnggcgatt cgtcctggtg tcaaacacta 60 

taaacctttg accagctgag ctgtgactgg ctgtcacntn tctgagtcct gtgtgcacag 120 

tantntcctg ggtcaggtaa aatccaggtn ttcaagtttt aaggnttttt tgaanaattc 180 

gggcttnttt aanacgatcc ntgcccaant ccacaagctt gttgacagtg gnttacagtt 240 

ngngtggcaa agtccaagtt gttacactgn gctttaaaaa aaatcttatc tgcatgtatt 300 

gttaacttag agaccatgag atctatttat caggaccagg aagatncaca cttcaggtcc 360 

attgcaactg acttttttct tgtttttctt aaaaccctgg tggagcctgg gaagggggcc 420 

tccacaattc tgtggctttg atattagccc caattttaca agcacataca agccccataa 480 

ttgccgcagg aaaacacaag atggaaaatg caataaccca tgcactgaga cttagaaaat 540 
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catccttact aggcaaaatg tattatgatg caataagtgc cactgggnat tttnacgttg 600 

ggactggnca ggaactgctg caaagaaaaa taacagctcc ttctccatta tttacattta 660 

agatgttggt ggggggaagg ttgggagaaa ttagttctga gggtatcata tgcctttttt 720 

aaagaaaatg ggaata 736 

<210> 552 
<211> 733 
<212> DNA 
<213> Homo sapiens 

<400> 552 

nagtttaann gtatgtcttg tcttttccaa gatcctatcc gattcgaatt cggcacgaga 60 

agtgtcagtt ttcctaatct cagtccaggt aggatttaaa aantntctca agtgttgatg 12 0 

ctntccaagc ntgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 18 0 

aatatctttt cttttgagag tncccccagt taatttntnc tgtgcttnat tgctnctgtn 24 0 

ctttattgtg aatgttgtaa cattttaaaa atgttttgcc ntagcttttt aggacttggn 300 

gttaaaggag ccagtggtct ctctgggtgg gtnctataat gagttattgt gacccacagc 36 0 

ttgtgtggga ccacatcact tgttaataac acaaccttta aagtaaccca tcttccaggg 42 0 

gggttccttc atgttgccac tcctttttaa nggacaaact caggcaagga gcatgttttt 480 

tngtnattta caaaatctan cagactgtgg gtatccatat ttnaattgtc gggtgacaca 540 

tgttcttggt aactaaactc aaatatgtct ttctcatata tgtgctgatg gttttaataa 600 

atgtcaaagt tctcctgtta aaaaaaaaaa aaaaaaaaac tcgagccttt anaactntnt 660 

gagtcgtnta cntagatccn gacatgataa gatcatgatg agtttggaca accncactng 720 

aagcagtgaa aaa 733 



<210> 553 
<211> 870 
<212> DNA 
<213> Homo sapiens 

<400> 553 

nagttaanag taggtcttgt cttttgcaag atcntancga ttcgaattcg gcacgagtat 60 

ataacaactt ttgctttcaa agttgggtgg gactagancn cncantggaa ggntggagtc 120 

agganacctg gattnttgng cccgntntgg nttttacagt ntgcctaant ttntgcagtn 180 

acttcntgcc ancctgtttc nttacntnca anagggaaag acantccttg gccagcctag 240 

ttttnagggt gaacgaaagg tcnttntcac tgcntcctct agtcatttgc ttcttcgnta 300 

attaacacat cttgagcacc tgcnatgttc caggaacagg agatggcanc gtgcaagata 360 

aagtccctga cttctagaga ctgcatgtta gtggcaatcg gcgtntaccc ggccttnaat 420 

aaactactga atgaaggaaa attctaccta caccagacac aattactggg gtttctaaaa 480 

tggaattatt cccccggccc cntgcatcca gcagcctgnt gcagggaaac tcctccnaaa 540 

ggcttgtaag gcaaggaanc cgggacaatg gcntggctat ttaagcttnc aacaagatgg 600 

ttacccctaa gtncctaatt ccctaacacc aagggggccc tttaccagga aaccaaaacc 660 

aggttaaaaa accccaaagt tgggnaaaaa gccatttgcc anccggggcc nttttaaaaa 720 

aaacctttna aaaacctttc ccttttaaaa ctttaccttc aagntaaaan tttaagggga 780 

atgggnccaa nttttttaac canccccaaa aaaaanttng gnaatttttt ttcccnaaat 840 

tttttnaant tccccaaatt tnggaaaang 870 

<210> 554 

<211> 766 

<212> DNA 

<213> Homo sapiens 



156 



WO 99/33982 



PCT/US98/27610 



<400> 554 

tatcaatgnt atgtntggtc tnttcgaaag anctagncgg ntcgaattcg gcacgagcca 60 

acacccagtt ctnactctgt catccaggct ggtgtgcagt ggtgcaatgt gggcttactg 120 

cagccttgac ctccaggaca agtgatctcc cacctnagcc tccggaatag ctgggactac 180 

agntcaacaa cgcccctctg aaagtaggac tcttggaaat gaaccttgtt gggagtaaag 240 

ctgaaccttc acctctcctt tccaggattc tactccattc atacggcctc acactgaatt 300 

aatgtttnta gcagccacat cacttngtta cccaattgat ctagtagtaa agtcttccca 360 

tctnttcatg taaaaaaaaa aannnaaaan gggnnaggaa ccntnangnt nnnaanaaaa 420 

aaaaaaanca ggnggngngc nttttttaac ctataacctg ntttnaggcc tttccccang 480 

tttntccnaa ncnnggttan taggggccna aagctaaccg natttttgnt cccntnaggt 54 0 

taggccngaa attaaccngg gtttaaagaa cncattgant aaagccttgc ctnggccaat 600 

tccgggaaaa gggaanagcc tccttgtttt acanattggg aaaaattggc cccaangggg 660 

gttaaccang tttgcccntt aataactnaa anggattttt gncaaaacct ggttccaagg 720 

ntttaanccc aanccttttn aaanntnggn cnctttggat gnaann 766 

<210> 555 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 555 

gttatccnat gngcgtntgt ngnnnncnct aanananttg gctngncgct gggccttgct 60 

tctctgagaa aactttggtc acacntccaa agccagggtg ggtgcctccn tgnaggaggg 120 

ggctttcctg gttggtggcn cagnaggagt ccaggctttg taccgtggac accatgggct 180 

atggcaacac cttcctaacc atccttccat gaggacctcg gnaganagtg gacatgaaac 240 

cctttgtgct ctgaancatt caacagaagc tttctggttc tgtgcctatt tctttggcac 3 00 

ttgancgtgt ttgcaggttc attacncaca tgatgaaagc tctggcccat agcactagaa 360 

ttcatgtttt nagggtttgt gagtgtgaca ggtgctatgg tttggatgtg gtttgtttcc 420 

accaaaactc ttgcttgaag tttaactgcc agcatggcaa ttgttggnag gtggggccta 480 

ccgggaggtg attgggtcat gggggcttga accctccgga atagattacn gctgcctcct 540 

ganaaagttc tacctgtcat gggggctgga tcagtcaaca ttgannantg gggttgttat 600 

aaagcaagac tnactcctta tgcaccgttt ntttgcatat gcccctctgg gggnancttc 660 

tttggctgct aacatttttg gacccaaccc aatgggcctt nacccagaaa nccggaacaa 720 

aatgccnnnn gccattcctt tnngganctt tccaacttnc canaaataat 770 

<210> 556 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 556 

gtngtcnatg anatgtcttg cctnncgaag aacnaggcgn ntcggtagaa cagaaaatga 60 

gcatccgatt tcttcactaa aggagaccaa actggttcct tgcggcctag tnttnaagan 120 

ctggancttg aaagtcctcc ttntaccaac tccacntcca ccccntnatt cccnttntcc 180 

caaagtncta ctgntgttgc ntgacanccc caaatntgtn ctgtcaacac aaacctgcct 240 

ttggngtata aacagggcnt tacagaatgg tncaccctat atatttctgt tcagtatcca 300 

ttcactagtt cttcattaat aaatatcatc ttccccattc tgctgctgaa tgccacacat 360 

ccatccagtc tgagaaagtg agagaggcaa tcatgccaag aacaagccag caaagctctt 420 

tcaccagatg tagactgtag ccctgctgcc ttccctccag cgagtctgcc agcatgcttc 480 

ttcatccttt taatatgtcc tttgcttcct acttccctgn cttccaacat actgtcactt 540 

actctggcag tcttctgctt ttcattaagc ctcaaaatct cctctgtcta cttggcacca 600 
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caagctatgt cctatatatg natttctgga cttggcangg atagttcaag gggtcttggc 660 

aagtttttat ttaccttcat tatttaaaan gggccttttg gggatgttgg cctntttaag 720 

gagccttttt ggggaaatca atacttctct taanaa 756 

<210> 557 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 557 

tcgtcnaaan nnatgtcctg gctatccgca ggatccaggc ggntcgaatt cggcacgagt 60 

gatttttttg gttttttttt ttgntnttgn caaaagctta ntcntttcan ttaaaantgc 120 

cactantttg actttttaag taaaaantgt agggggtttt aaanctactt tcctnctncc 180 
aaaaantcag aaagtttcta nctttntaaa ttgggaaagc aagcantgtt ttaaaancac ■ 240 

tgaaggaatc tctttnttcg ngnccttttg ttaaactcgg tttaagctgt agaccttntt 300 

taaantaaaa tttaccacag aacaggaaat agaanctgtg gaagactcga aatacacctt 360 

tgtncttctc tgttcttcac ctgctctctc gctgtctcta cacacacaca cacaaacaca 420 

cacacaccta tatttgcatt aaaaatgggt agtaaaagca gtgaagggca aacagaaggt 480 

ccattncatc aagtaagagg ttgaatataa actggaccaa gtcttaattt tttatttcct 540 

tcattcggat ncgtttacta atttctttgc tagctttaag acttttaaaa cattctttgg 600 

ccctgggagg gagttgttta cccctaaact tggagaatcc tggccctaga ataaatgttc 660 

cttttaaacc cccanggccg gaaaattgaa tncngctgtg ccaaaaagga aaaaannnaa 720 

aaaaaaactc gnggcctnta na 742 

<210> 558 
<211> 730 
<212> DNA 

<213> Homo sapiens 
<400> 558 

gggtcnntaa tntnnagcnt gtnaaacccc tgagncttnc gggncgttca caaagaaaca 60 

tttaataggg acttncaanc aaataattnt cggtttntca ggtggcagca agacaagatg 120 

gtggatcccc atgccattac ctgctagact cagggttnat atactgtagt ggaaaggtga 180 

ttccgaagga atgttgtaag acaattgaag tgcagtanca tcaaagttat ttgacctaag 240 

ggcaggagtt ncagtaagta tccactttta tncaagaaac antagataaa ctggaaatct 300 

tggagccctt cctggaactg gggttaatga gaagtcaaca tggtggatta ncatggaaga 360 

tggagttgct tagtctccca ttcaagatgg agtttcttta gcctccattg atagggagtn 420 

tttaacaaaa ncangaaata agtctttgat ccattgaatc tctaagagtg agcccttgat 480 

gactcaggtt taaacagtnc tgagacaatt taggagatag ttttgaagnt caatttgaat 54 0 

tgtaaaaggt caggattttt taactttttc acatctttga anaaaagccc atagagcgca 600 

agttttcagc aaganctgga aancnatatt nctatggaat taaatagctc ctcagggcaa 660 

tcaattnggc ctggganaac ataatgcttc aanggctgan gnaatctgga atttctatgg 720 

gatttettca 730 

<210> 559 
<211> 743 
<212> DNA 

<213> Homo sapiens 
<400> 559 

gttagtctat aangtnngnt atgtactngc cctttccgnn ggatcccntc gnttcgaatt 60 
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cggcacgaga 
gctagacact 
cttatttact 
cactgggtct 
acagccaaac 
taacttaaat 
tcatttattg 
gggagacatt 
gctganggtg 
tcagcttgga 
caattcactt 
ggtggttaaa 



ggaaacaccc ccttataaaa ccatcatntc aggctgggtg atctgacaga 
gtcaaacaaa caaacaaaca aacaaaaaaa ccccatcaca tctcatgaga 
atcatgagag cagctcagga aacacccact cccgtgattc agttacatcc 
gtcccacaaa ttgtgggagc tacaattcaa gatgaggttt gggtggggac 
cctatcacca tgtaaaataa tatctaattt gtagagatta aagaacaaga 
cttggatgta agttaagaga gtggtggtca gagttaaatc attttaaggt 
tctggacaag aataaaattt tgattatcag gaaatacaag taaaaccaca 
gnttatatcc aaattgtcaa aaattacaaa gtcttataat accaagtttt 
tggagcaaca gaaacttttg ttcactggtg ggtatataaa ttgaataatt 
cattacctag caaaattgaa ggctgtatac gtacatacct accaatctag 
ctagatatta agtcttgaaa aactcacatg tttccagaga cgtgttaaaa 
tcattntgng aat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
743 



<210> 560 
<211> 833 
<212> DNA 

<213> Homo sapiens 
<400> 560 

atccngttct ntannnngtc tngttctttc tncacgatcn nntgcgattc gaattcggca 60 

cgaggggtcc tggtgggagt tccatccagc agtgagtgca ttttttcccc agagcagtta 120 

agggtcttat taaaagccac cactttgctg aggcctgtac aggccttggg ggtttgggga 180 

agagaantaa ggcaggcact tgtcccttca gggagggact tgtccntact gggaggtttg 240 

gggttgacct tggctccagc agagataccc agcctggcnt ggaagggcag gtcttgagct 300 

tacgcttgac tgcaagggca agctgcaggc ctcttctgcc ttcccctgca ttcaccaagg 360 

acaagtagga ccaagaagtc aagggaaaag tgccaagata gatctattcc catttctttc 420 

ttccacctgg agaattcctg agctatgctt caaacctctt ttgggccagg gaaagactgg 480 

gggacatttt ttagtcaagg atgctttaag aaagtaaatt cctgcttggg ggcccaggcc 540 

ttctttttca agggcttgct tgtgaatgcc caaccaaaaa aaaggggccc ccaaggccca 600 

atcccttact tcctnggtcc ccccaaaaag ggatnccaan ttggggaatt gggaaaactt 660 

gggcanncac ccnaanccca ctttggtagg anttnaccaa cccaaccaac ccaaaaccan 720 

cccacccaaa ttnaaaaaaa ggccaaaacc accaaccaac cnaaacccnn annnnnnnnn 780 

nannnnnnnn nnnaaaaaaa ctttgangcc ttttaaaaac tntttngngn ggn 833 

<210> 561 
<211> 773 
<212> DNA 

<213> Homo sapiens 
<400> 561 

tagtctaatg tnnnaaantn ngcnctngtt ctttctgcag gatcccatcg attcgaattc 60 

ggcacgagga agaggaggct gtgtatgagg aacctccaga gcaggagacc ttctacgagc 120 

agcccccact ggtgcagcag caaggtgctg gctctgagca cattgaccac cacattcagg 180 

gccaggggct cagtgggcaa gggctctgtg cccgtgccct gtacgactac caggcagccg 240 

acgacacaga gatctccttt gaccccgaga acctcatcac gggcatcgag gtgatcgacg 300 

aagctggtgg cgtggctatg gccggatggc cattttggca tgttccctgc caactacgtt 360 

ggagctcatt gagtgangct ganggcacat cttgcccttc cctctnaaca tggcttcctt 420 

attgctggaa gaagaagcct gggaattgac attcagcact cttncaggaa taggaccccc 480 

agtgangatg aagcctcagg gcttccttcc ggcttggcag actaacctgt caccccaaat 540 

gcagcaatgg cctggtgatt nccacacatn ctttcttgca ttcccccgac cttccagaca 600 

gctttggctc ttgcccctga caggatactt gagccnagcc cttgcctgtn ggccaaaccc 660 
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tgaattgggc cacttgccaa acttgcnggg gaaagggttc cttgaaacaa gggggccatt 720 

tttggggaag gcttcttggc ttggcctttt ggcatttnaa tttggccttt ttt 773 

<210> 562 
<211> 655 
<212> DNA 
<213> Homo sapiens 

<400> 562 

nnatanacat taangnnaga ngntgagnan ttnccntcgc tctntganna naaggcgncg 60 

cgaattcggc acgaggccac cggtctcttc ctaatctgca cattntattt tgggtatttc 120 

tgggcgggca gttcctttgc atgtttcggg agaggtttgt tgatttgggg cttatatgtc 180 

aggcctttgg tttgcgtctt attttagggg ttgtttgggg gcctgggtgg tcggcctcac 240 

atgggaaggg gatgggtagt ggatggggtt tctgtcgnat cttgnggccg gtgattttgc 300 

tnncgnnctg tttcacattc ttccccctcc acaagccaaa tcgttcattt ggntncactg 360 

tgtggactgt ctgagcttgc cctgccagaa aaatttgggg ctaggcaccc aggtgcanac 420 

tttggaagaa gcantccacc tgtgggtacc gcatctcgtg ngtcccactg gcaggctgaa 480 

cctacttgaa catggaaaca gcatgcccat atggcaaagg ggccnnnacn nnngnnnaaa 540 

tnnannannn ncngacannc nncnnaatca ngannntcna cannnatcnn annnnancnn 600 

nncaantacn ncnaaaacac accnnccana annnnnaann nnnnnncann nnnac 655 

<210> 563 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 563 

tnntaatgct ggaattcctn atncttgggc tactcgttct ttctncagga tcccntgcga 60 

ttcgcagaaa agagtatagt aggggatgac caaggtcaaa gtgggtaaag aagactcatc 120 

atccactgag tttgtagaaa aacggagagc agctcttgaa aggtatcttc aaagaacagt 180 

aaaacatcca actttactac aggatcctga tttaaggcag ttcttggaaa gttcagagct 240 

gcctagagca gttaatacac aggctctgag tggagcagga atattgagga tggtgaacaa 300 

ggctgccgac gctgtcaaca aaatgacaat caagatgaat gaatcggatg catggtttga 360 

agaaaagcag cagcaatttg agaatctgga tcagcaactt aggaaacttc atgtcagtgt 420 

tgaagccttg gtctgtcata gaaaagaact ttcagccaac acagctgcct ttgctaaaag 480 

tgctgccatg ttaggtaatt ctgaggatca tactgcttta tctagagctt tgtctcaact 540 

tgcagaggtt gaggagaaga tagacccagt tccatcaaga acaagctttt gctgactttt 600 

atatgttttc agaactactt aatgactaca ttcgcttatt gctgcagtga aaagngtgtt 660 

tgccatcgat gaatgctgca gaaatgggaa gatctcaaat tctttgctca aaaacgtgaa 720 

cttaaccaaa atgatggt 738 

<210> 564 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 564 

nggggngtct aatgctgcnc nnatcnannc anggnnctcg ctctngctcn acnnanaagg 60 

cgntgngtgt gccaccacac ccagctcatt attattatta ttattattat tattttgaga 12 0 

cgaagtttca ctcttatccc ccaggctgga gtgcaatggt gcgatactgg ctcactgcaa 180 

cctctgcctc ctgggttcaa gcggttctcc tgccttggca ggcacctgta gtgtcagcta 240 
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ctcgaagctg aggtgggaga atcgcttgaa cctggggggc ggagattgca atggtgtggt 
ctcggctcac tgcactcgag cctggcgaca gagcaagact ctgtctcaaa aaaaaaaaaa 
aaaaaaactc gagccntnna actattngng aggtcgtatt acgtagatcc agacattgat 
aagatccatt gatgaagttt gggccaaacc ncaacttgaa tgcnnngaaa aaaagcttaa 
ttgggaaaat ttgggaatgc ctatngcttt atttggaacc ctttntaagc tgcaantaaa 
acaagttaan caccncccaa ttggcntcca ttttaatgtt tncagggttn aggggggaag 
gttttgggaa ggttttttna aattcncggg ccnnggggnc ccaatgcttt ggggccccgg 
gtncccaann ttttgggncc cttttaangg gngggnttan attggccccc cttgggggna 
aaancgnggn anatacctng gtcccctgtg nanaaatngg nttcccntta caaaatttcc 
cacnnanatt tnngnncc 

<210> 565 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 565 

ttntnngttt naatnntcnn ggnntcgntc tnnctcnaan nanaataggt ttggcgaatt 60 

cggcacgagc atgctggcca gcatccctgc ctgtgcaagc tctggatgag ctgtgtgccc 120 

ctgccacnca cacccngcac tccctgccag cctggcctca gggcctctga tccatgtgca 180 

ctggagtggt gatgactgac agggccactg gggcatttnc acgttaacag cagctgccac 240 

tggcaaaaga agtgactcgc caatggtggc atctcagatg tgggcccagg agtctgggga 300 

gctactttga acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc 360 

atgtgctgga gtagtcaagg gaaataagac actctccttg tccttgttaa ctcaatcaac 420 

aagcatttgc agagcaccgc ctatatgccg gcgctgtccg aagtgctgaa gatacagcaa 480 

tgagctaagt aagcactgac ttcgtagaaa accataacat cggccatctt tggaaaagag 540 

aaaaacaatg gagttactta tttaaaaaaa aaagaaagaa agttatctct tccanganag 600 

gctagaagta cttttctgct ttttggccag tgcccantgg aatgcctggt ttgggggaag 660 

aagaagggac tgggttaact gtggtgcttt tgttgtaaaa aggcanctgg cctttgtact 720 

tgaggagaaa natggagcct tggg 744 

<210> 566 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 566 

gnagtnntat tgatttntct ccgtgaatcg ttctnnctnn annanaagtg ngttnngccg 60 

ctggctatgt ggacgctggg gcagagccag gccggagtcg aatgatcagc caggaagagt 120 

ttgccaggca gctacagctc tctgatcctc agacggtggc tggtgccttt ggctacttcc 180 

agcaggatac caagggtttg gtggacttcc gagatgtggc ccttgcacta gcagctctgg 240 

atgggggcag gagcctggaa gagctaactc gtctggcctt tgaggtaatg gggggtggcg 300 

gtggtggggg gtgcttantg gctatgctca ccccgctnca ttangcctat tttggtctgc 360 

tgtttccaaa tgcttctana tctaggcatt tggtatccaa cctattgcca cantgcctan 420 

aactncanac ccccngccnc tatgntnana cctacttggc acaagaacaa nngnanacnt 480 

tgnnnatatn ccanaangnn naanattaca nantnttata ataccaattn ntnttgangg 540 

tgttnnnnnc anaaacnttt gntnacngnn nnnnntatna atnnataatt nnnnntttgn 600 

nancannanc tatgnnnaat taaangnntn tntncnnnnc nnnacnnnna nnnnnnttan 660 

nnanttncnn ttnnnntnnn nnnnnnnnnt tnaanaannt nnnnnttnat nnnannnncn 720 

nctnnaangt ntntttnnnn nnatnnnnnn nnnncg 756 



300 
360 
420 
480 
540 
600 
660 
720 
780 
798 
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<210> 567 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 567 

gnntgtnntt nncnnannnn anganagagn tactcgctct ntctctacga tanantgngt 60 

tncgaattcg gcacgagatt tcctccagtc ctgggcccca tccttnaggg ccttcccagc 120 

cagccagcag gagaggcaag aactggggga acacaggaac ctaggggagg aggggagcgc 180 

tgggcatcct caggctggcg gccaagcctg cccctggagg cactagagga gggcatctgt 240 

ctgtgggagc ccagagctgc agggaggagg aggagggagg tatctggtgt gagcgttgcc 300 

cctgcgacat ttgggaccac acaggtgggc ttccttattc cctgacaaag cctctgtttc 360 

cagctcttcc gccctctctg gatgagggaa cagaagtgga ggaaacaaaa gaagcagcag 42 0 

cacgcacagt cctgtcgctg ggtgcggaga cagcctggca aagtcccact cagccatggc 480 

ctgatgcang ccccagccct nctttcttgg gtgtcaaatg actgtgtcct ggacatctga 540 

tgcaccacct gccctgcctg ttgcaaacgt gatgctcccg gatggaatgg agaaactagg 600 

agactgggac aagcaaaang ctgcaaacaa cccagaaccc attcttagaa nactggagaa 660 

atgattgagg aatcattggc accgtggncc tgtgcttcat nacaaacacc tttnagaaca 720 

acttgggatt gaaaaaccaa gacant 746 

<210> 568 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 568 

gnnnntngtn gttcttanng ttnggatctc gttctttctn cacgatcncn tcgattcggt 60 

ctgggcagcc tacgctttcc ggataaaaat ggcagaatga aagaaattat gagtggaact 120 

agagaatagg aaagacatga accaacgccc aaaatgagaa agaaggacat ataaagaaaa 180 

agacaaatac aagtgaaaaa aatagactaa tggattaacg tccctgtcgt gtgacatttt 240 

ctgctatgga aatgatatta gacaaaaagc acttcaagtg gttttcttat ttgagttcaa 300 

aatgggtcat aacgcagcag agataacttg aaacatgaac agcgcatttg gcccaggaac 360 

tactaacgaa catacagggc agctgtgatt caagaagttt tgcaaagcag actagagcct 420 

tgaatatgag gaacacagtg gccagccatt ggatgcttca cttcttgaag catcttgaca 480 

gctttttgca ggtgaaatgc ttncacacca gcaggatgca gaaaaatgct ttccaagagt 540 

ttgttgaatn cagaacatgg atgtttatgc tgcaggaatt aacaaattta tttctcgttg 600 

gcaaaaaagt gttgattgna atgggtccta tttgattaat aaagatgtgt ttgagcctaa 660 

aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn rmnnnnnnnn 720 

nnnnnnnnnn nnnnnnat 738 

<210> 569 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 569 
gtttntgant ntgattctta tgcttngnct 
gctggaggag aggagctcag agttctacag 
tgaagaggtg gaagcaaagt tcaagcgata 
ggccaaaatt cttcagtgtt accgtgagaa 
ggccacccag tatatgcact gtgtcaatca 



aatgctttnt ctnnangatc 
agtctttnct gaacaatatc 
tgagtctcat ccagtctgtg 
cacccaccag accctcaaat 
tgccaaacag agcatgcttg 



ccnncgattc 60 

agaaagctgc 120 

ctgatctgca 180 

gctccgctct 240 

agaagggagg 300 
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ataaaaactt tcagaatgag caaaacacca tcaacgttaa ttccagagat ggaacatttt 360 

ttttcctagt gagaaaacaa cccatttgaa gagaagaccc taatgagaag accctaaaga 42 0 

gagacatcaa gaatggattc agcagaatca tttcacgttt tgaacagcag cagtttgaan 480 

ggccaaagcc tttgatcagg gatcccgtca ttaaaggaca ctcttgagta ttagtaaacc 540 

ctcttatgat gattaaaaga gaagggcagc cctnttcacc tttttggtct ttctattcaa 600 
cttgcctgac cataaaatgg ttctcttctg nacaaagccc catcatttgg tgaacctcac , 660 

ccttaacaaa gtaggattgg ggttgggggg cttaattaat tggaatgggg ccaaggagaa 720 

gagcccgaaa ccttagatnc cangggnana agt 753 



<210> 570 
<211> 832 
<212> DNA 

<213> Homo sapiens 



<400> 570 

tnatnaataa ggtttgantt cttatgcttn ccaanngctt ggacctannt anccangcgg 60 

tgcgaattcg gcacgagcca ggccccaata atctgggntt naaactttga ggaaatgcca 120 

gtgacttatt ccagagtgcc tcagttaggg gaacttctct gtaaagaacc ctgggtattg 180 

agcaaaaacc ttattatcgt taatgaccta taattggaag cttcctgcct ttttctttgg 240 

ttgctcctgt ggaaaatact gaaaagatta ctttgtttta ttttgttgtc tttttataaa 300 

a ggggaggtg gagagacccc ttcagagcag ggattgtgcc gggagagtgc ctctgacttt 360 

gggacatttc atccacagaa atttncaagc caatggtttc ttttgggttt tgggttttta 42 0 

tgtttgnttt ttgggggttt ggaaaaacat gcatttttac cgtgcacgta aaattggtca 480 

nagaaaaagg gagcccagaa aangcagcan atgggccatg cccctttgct gggttttcct 540 

tttcttttgg gactgtnaag gggaaatggg tttttanaag gtgaaggttt ggtcctgttg 600 

gaaggaaaag aantgtctct gttngggggg acaanaaggn acccttgggg gaggtccatt 660 

cgcaatggtn cctaccaaaa cnnggntctt taanaacacc ngggcctttg ncccaggnaa 720 

aaaaccctgg gcccctttaa naaactttgg nangggaacc ccggaaaacc cccttgggcc 780 

ttnccaaatc ttttttccca aagncncccc cggggggccc aaaaaaaaac ct 832 

<210> 571 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 571 

agtnttaatn ntggacttct aanganttng gctnntcgnt tggaannnnn cagtnctcta 60 

nnagcccatc gatgcgaatt cggcacgagg ctaggattac aggtgtgagc caccatgccc 120 

agccacttat ctttaaagga ttaagtttat gtttcctact atgggaaacc atcccacccc 180 

aaacttgatg accgcattat gtgcttttat agaacatggc acttctccag gatagcattt 240 

attctgtttt gtaagtgtga atgtaattac cctacacaca gcatacacat aatcttcata 300 

ttctttgcct tgtcttgtga aggcaagggc catgtctatc ttattcgtca ttagattccc 360 

acatccaaca tagtcctggg gacagcacca atgcactttt ggtgcataag caaatagtgc 420 

atttatagct cttacctaca atatctgata gactaatcaa atatagtagg ttatctgggc 480 

ctttttgatt catgtctcta gcttaacttt catttttttc ttatttggta tctctcactt 540 

tgccttttga tatactctta cagtttcgct cactgagtaa aagaaaatnt aaacagcaag 600 

aagtaaactt gtgttttatg gatttngata acatcttcta aaagaccccc caagattgtt 660 

gatgtctaaa aaaattaaag ggccttcaac tcataataat acttaatagt tcttaaaata 720 

ttacaaactg attggaacat tgcctaac 748 



<210> 572 
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<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 572 

agtcttatta nnnngttcta atcctttctt aangagnnta ggctactcgt nctttctgca 60 

ggtatcccnt gcgatncgaa ttcggcacga ggctgagcac ctttggaaac aacatttaag 120 

ggaatgtgag cacaatgcat aatgtcttta aaaagcatgt tgtgatgtac acattttgta 180 

attacctttt ttgttgtttt gtagcaacca tttgtaaaac attccaaata attccacagt 240 

cctgaagcag caatcgaatc cctttctcac ttttggaagg tgacttttca ccttaatgca 300 

tattcccctc tccatagagg agaggaaaag gtgtaggcct gccttaccga gagccaaaca 360 

gagcccaggg agactccgct gtgggaaacc tcattgttct gtacaaagta ctagctaaac 420 

cagaaaggtg attccaggag gagttagcca aacaacanca aaaacaaaaa atgtgctgtt 480 

caagttttca gctttaagat atctttggat aatgttattt ctatctttat ttttttcatt 540 

anaagttacc anattaagat ggtaagacct ctgagaccaa aattttgtcc catctctacc 600 

ccctnacaac tgcttacaga atggatcatg tcccccttat gttgaggtga ccacttaatt 660 

gctttnctgc ctccttgaaa gaaagaaaag aaagaagact gtgtttttgc cactgattta 720 

accatgtgaa actcatctna ttaccctttt ctngg 755 

<210> 573 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 573 

cangtctaat gctggctctn atcggttctt nnnantnaag ntactcgttc tttctncang 60 

natcnnntgc gntncgctca cacagcatgt gtcagatcca tggggtagga gtcggccaga 12 0 

gacttggtaa cagacagatt gctggatccc acccctagac tctctgattc agttagtttg 180 

gggtaaggcg caagactgaa tttttcacaa gtttcccagt ggtgctgata cttctggtcc 240 

aggaacttag tggggagaga acgactaatc tagaccattt cacttcacat tctgagcttc 300 

ttgtcactgt cacactgcat ccttttaaca atgcattccc tatcctattg caatactgac 360 

atctcatcaa tattttaaaa catgcgtttt cagaaacaat attttatatc aaatactcac 420 

ttttagtaat atttctgcaa ttttgcccta tggatctgag atctaacaaa tactattctg 480 

gacatgggct acaacagttg aggctggaag taaaaatgtt aaaccctgct gaccacgtta 540 

ttttaaagtg tattttagtt aagaataata tggcttagga gcagggctaa acagtagcag 6 00 

tcacatgggg aatgatactt tgcttttgca cataaaatgt cctgaaggga aaaaataaag 660 

cagaaaattn ncagatgaac tgaaaatctg tacaaatgtt gggctgaata ctgccagcgt 720 

tgangtgtag gaaaatgaac cnt 743 

<210> 574 
<211> 737 
<212> DNA 
<213> Homo sapiens 

<400> 574 

ccgtctaatg ctggnttcta atcgctttct taangctcnn gggctcgntc tcnctncacg 60 

cagcccggcg gtgcgaattc ggcacgaggg gattacaggc atgacccacc gcgcccagcc 120 

tgtnatttct tatactntgt attttggnct tgtattatgc ttctgatacg ctataattat 180 

ttatgtccat gtncntttct tcaatagact gtgaactctt cgaatgtngg actcctagag 240 

ctagatnctc nattattnnn tattaaattg aatgacttgn aactacagat cctttattta 300 

aacttcccaa atttctgctt tatctaggcn actctttaaa ttcttttatc tcatgtagat 360 
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ttcanaggct gaaataattg agatttttag tttgaagaaa 
gcnttattat tatattttta atggctgttt gggagtnagg 
cctcctaaat ncttaaatat ttcctaaaaa caggncattc 
ggctctggcn tccaggctgg antgccgngg cccatcttgg 
cgattcncgc tggtctcctg nctngctgct cgggaggctn 
ccggaggcgg aggttncnan agcctnnacg ggccctnggn 
cggacctccg nctgnat 



agagaactgn ggatttaatg 
ttgcagacat tggtcacttt 
tttnttttnt tatggagtct 
cttactgcag ctccccttcc 
aggccnggga atcgttgacc 
ctcccggctg ggtacnngac 



420 
480 
540 
600 
660 
720 
737 



<210> 575 
<211> 766 
<212> DNA 

<213> Homo sapiens 
<400> 575 

gnagttnaaa agcggntttt antcctctcn aatcngnttg ggctactngc tctttctgna 60 

ggnatcccat cgattcgaat tcggcacgag ctttctccct ctgtgcctcc tgcttccttt 120 

ctctctcctg cctctcctct gctccccatc ccactttctc atctgcctcc ttttctcact 180 

tctgtcagtc tgtaagcttt gataacctgc ttaatactcc aaagtgtgag ttcctctgat 240 

ctcttgattc cttagttcta atctcacgtt ttgtttttaa gagatggagt ctctcactct 300 

gtggcccagg ctggagtgca gtggcatgat catagctcat tgcatccttg aaatcctggg 360 

ctcaggtgat cctnccgcct gagcctcctg agtatctggg actacagatg cgtgccacca 420 

agcctggcta attttgtctc atgtcttcta aaaattattt tgtgaagccc cttcacaaaa 480 

aaccttaang gaaatctgat ggtgctcagg aatctaactc tccctaaacc atcctctttt 54 0 

aactgcttct aaaatatctc tggtggcctt tcttagcctt tttctggttc attcaatgct 600 

tcaaagcgct ttttgnttct aagttgagtn ctttgggggt ttgacaggta gtgacgtgta 660 

gttttgacac tgttaacttg ttnaatacag tgaaaangtt tgtgaagtga aaaatgcttg 720 

anaaagaatg gnaatgcctt tntacaaata aaagtnttgt taaaat 766 

<210> 576 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 576 

ggggtnnnna gngnnttgan cccctttctt attatcaagg ngctngcnct nnctnnannn 60 

ancacaggcg ntgngaattc ggcacgagaa gataacctct taatgcattc atgttgtata 120 

tgaaggaaat gagagcaaag gtcgtagctg agtgcacgtt gaaagaaagc gcggccatca 180 

accagatcct tgggcngagg tggcatgcac tgtccagtag tatttattgc tttagagatt 24 0 

gcttgtcgta cctgtatgtc gtcccttttt aaatatgttt tcctttttct tgaaactgta 300 

taaagttttt ttccccctta gcataagcat cttatatata acaactcatt tgtacaaggt 360 

ttttaagttt atatataaaa tgtgtatata tatttttgnt tccccttttt gacttttttt 420 

ttctgtatga aacccagatg tcaccaaatg gacattaata gttgcattaa ggatcagtag 480 

cattaacaaa agttgcttta aaagccatta tgtaaaacaa gacttgaaaa tgagtgaggg 540 

aattttagcg acactgtctg agcacagtgg gaaccatctt cgtttcccct ttgaactcca 600 

antgggatgc cctaccctgg cgccccttag gaccccggac tggcccgngt acaaaacttt 660 

accgtgccaa aattcttaag tgaatttacc tttctncctc tttttgaagc tngaaatttt 720 

tggtcatcan gntttgcttg tgatngtaca tanggtngaa n 761 



<210> 577 
<211> 803 
<212> DNA 
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<213> Homo sapiens 
<400> 577 

gggtngttnn nnngtggnnt tnttnnnngt ttctaatnnt cgnggngntc ganctnnctc 60 

nananagaat aggtttgnga attcggcacg aggtctcccg cccggcgccc ccagtgtttt 120 

ctgagggcgg aaatggccaa ttcgggcctg cagttgctgg gcttctccat ggccctgctg 180 

ggctgggtgg ggtctggtgg cctgcaccgn catcccgcag tggcagatga gctcctatgc 24 0 

gggtgacaac atcatcacgg tccagccatg tacaangggc tgtggatgga ctgcgtcacg 300 

cagcctctag aactatagtg agtcgtatta cgtagatcca gacatgataa gatcattgat 360 

gagtttggac aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt 420 

gatgctattg ctttatttgt aaccattata agctgcaata aacaaagtta acaacaacaa 480 

ttgcattcat tttatgttca agttcagggg gaggtgttgg aggtttttta aatnnncggc 540 

cncngcgcca atgcattggg ccccgtaccc acttttggtn cctttaantg aagggtttaa 600 

tttgccccnc tntgccgtaa ttcatgggnc atanncttgn tttcctggng ttgaaaattg 660 

gntaatcccc ttcnacaaat ttccnccaca atcatttacc aaaccccngg gaggcctttn 720 

aaagnngtna aaanccctgg gggtggccct taatttaagt ggnnccttaa ctcncnttta 780 

antgccnttg cccttcactg cct 803 

<210> 578 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 578 

tcgtcccntn gatcggggta acgtccttnc ctatnaaant tctttcggga aagcagaaac 60 

caagctggca gaagcacaga tagaagagct ntcgtcagaa aacacaggag gaaggggagg 120 

agcgggctga gtcggagcag gaggcctacc tgcgtgagga ttgagggcct gagcacactg 180 

ccctgtctcc ccactcagtg gggaaagcag gggcagatgc caccctgccc agggttggca 240 

tgactgtctg tgcaccgaga agaggcggca gatcctgccc tggccaatca ggcgagacgc 300 

ctttgtgagc tgtgagtgcc tcctgtggtc tcaggcttgc gctggacctg gttcttagcc 360 

cttgggcact gcaccctgtt taacatttca ccccactctg tacagctgct cttacccatt 420 

ttttttacct cacacccaaa gcattttgcc tacctgggtc agagagagga gtcctttttg 480 

tcatgccctt aagttcagca actgtttaac ctgttttcag tcttatttac gtcgtcaaaa 540 

atgatttagt acttgttccc tctgttggga tgccagttgt ggcaagggga ggggaacctg 600 

tccagtttgt accatttctt tgnatgtatt tctgatgtgn tctcttgatc tgcccccact 660 

gtcctgtgaa ggacagctna ngncaaggag tgaaaaactt tacttcttaa aaaaaaaaan 720 

nnnnnnnnnn nnnnnnaa 738 

<210> 579 
<211> 758 
<212> DNA 
<213> Homo sapiens 

<400> 579 

gnngtgncta nctaaatnnt tggntntaaa cgtncnttct gcatnatccc tnnttgacga 60 

attnggcacg agacagagtc ctgaaatatg caaatgaagt aaattctgat gctggcgcct 120 

tcaagaaaca gcctaaagga cctgcctgat gtgcaagagc tcatcactca agtgcggtca 180 

gagaagtgct ccctgcaggc cgaagccatc cttgatgcaa acgacgctca tcaaacagag 240 

accttctcct cccaagtcaa agggacaaat aagcctctgg gttgaacggt ttgagacatt 300 

ctgccttgga ccttcccttg tcaccaaaca agccaacctt gtgcacttcc accaggcttt 360 

cacccattcc ctgcaagcct tggtcttttg acctggccct caaccatgtg gctttccacc 420 
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ccttgaggac aagttggaac agaagaccaa gagtggcctc actggataca tcaanggcat 480 

ctttggattc aggagctaac caggctcttn ctcgggggcg ggggagattc tgactcttaa 540 

tctggattgt gagaaaaatc cagcaagttc catgatattt aaatccaggt ctgcattggc 600 

ccggggcaag agtttaacat cttcgggccc tgcatttcct acatcttggg gtctgtacac 660 

gttcttaagc aagcgtgtca ngagagcacc ctgttggctt cttggtaaaa tgtgtgcaag 720 

gtcatnctgt cttctgnacc ttctggggaa aagggncc 758 

<210> 580 
<211> 816 
<212> DNA 
<213> Homo sapiens 

<400> 580 

tttctaaatn gcttgggttt cnaaatccct tggttgacgc cctcgcctaa nntggcgtgn 60 

nantgcccnc gattcgctgn caagtctgga antcatattg gagcctgngt ngactgaaaa 120 

ctcagcanga gttgatgtta aagtcttggg tctgaaattn gtngggcagg agattaggct 180 

ggaaactcag gcagaatttc tgtgttacaa tcttgaggca taattcttct ccaaaaaaat 240 

ctccattttt ttctcttaaa gccttggatg agccttggat gattggatga ggactaccca 300 

cattatctag ggtaatctcc tttgcttaaa gtaaactcac tgtgttaatc acatcaacaa 360 

aataccttca cagctacatg tagtgtttga ccaaacaact aggcaccata gcctagccac 420 

ataaaattac tatcattata ctttgtctta tcacatactt ctaccttgga agggatattt 480 

cccagttggt atagctacaa aacagaggca gatcatttag cctgcattng attngtantg 540 

aaaaataagc ctttggtgng tttaaccact gaaaatgttt gcggcctatt agtantngca 600 

caacttatcc tatnctggcc aaacatagaa tgctttcggt ttgcaaggta acangatccc 66 0 

ctttacagnt gtacnaaaaa tnancnntaa aaaaactnga gccctntaga acntnntagt 720 

ggagtcggan ttaacgttng ancccagacc ntggattang gatncattgg atggagtttg 780 

gacataccac cancttggaa tggcnantga aaaaaa 816 

<210> 581 
<211> 868 
<212> DNA 
<213> Homo sapiens 

<400> 581 

ccnnganncn nnccnnnnnc nnacaaaanc nnnnnnnann nnnnnnancn nnnnnnctct 60 

tcnaannctg ctnacgccca nagcatgacc cacgatcgaa tcggcacgag gttgcaagca 120 

gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtn ccagaatgat 180 

acacggatca gngcanaagn tcatcaggct ctcggacctt agggctgntg gagaagcttc 240 

agcagcagaa ctgatggtga aggctcgtgt tctccatcct caactttctt tgcttcgatc 300 

atacacaaga atacattngg aagggcaaaa aaatgaacac tgtcgnncat tgcagcccgn 360 

gtttngtgac acagatgcac agtctgcttg tgaagacctt ctctcaagtg gcatttggga 420 

gtccatgcca gancatggtg cttcatgaga gactgacagc tatcaggggt tgnggcactt 480 

agngaggact ctcctccccc agtgtgtgct gatgacacat cacacctgac aatagctnga 540 

agnctnctct gacccntntt actctgtagc caacatacca catganttta aaacccnttc 600 

taaaatatcc aancaatggg gtcatacntg gcccaaatgc cagaantcna gagcctaata 660 

ggacttccaa tnattaactt tnccaaannc gaaaaaagna gggcnttccn nttatggcaa 720 

aaaaatnaan nnaaaaggan atntggnatn gttngccnaa aaaaaaagcc cnntnngaaa 780 

cctaatanga ggaggtccca cttaaccggn cgnancccca gaacantgga atacaggant 840 

accnatngga ntgaanattt ggggancc 868 

<210> 582 
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<211> 745 
<212> DNA 
<213> Homo sapiens 



<400> 582 



ttctgaatac cttnttacnc 
cctttctgcc ttctgtttgg 
agggctgtgc tatccaatac 
tcacataaga ttaaaaattc 
gtatgtagtt catggctact 
attctagaga actaaactgg 
gtctccctgg gtaggagggg 
tgttgtctgg gacccttcct 
cnngtcttct ctgaccccan 
ntggccatta nctnccntna 
ntncccccaa ngncctttcc 
ttttanaant gnnnnnntgn 
ttntnaaaaa aaannannaa 



gccttcttca ggantttcaa gacctaattc ggcacgagac 
gacccagctg gtgttctttg gtttgctttc ttcaggctct 
agtaaccaca tgcggctgtt taaagttaag ccaattaaaa 
cttcctcagt tgcactaacc acgtttctag aggcgtcact 
gtactgacag cgagagcatg tccatctgtt ggacagcact 
cttaacgagt cacagcctca gctgtgctgg gacgaccctt 
ggggaatggg ggaagggctg atgagacccc agctggggcc 
ctnctganaa gggaggcctg gtggcttaac ctgggcangt 
tggctgcngt gaaggggaac cacccttcct tgcttgacca 
ccacttgnaa cccanggtcc canctggctg ggaccctntt 
cttgggctnt nttggantga gcaccttctn tgtnngcacc 
tactgatttt tttgntaaaa agannttaaa anctggnant 
aannn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
745 



<210> 583 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 583 

gnttctaatn cttggcctac tcgcctntct ncaggatctt atcgatncna attcggcacg 60 

agatatggta tagttggaaa taggttattg tgagttattt gtagtcatgt ctttaatggc 120 

ccttgcatgg tgtctaactt ctgcaataaa tgatctgcca gtcctagtgt ctgggcttta 180 

tgcaatttgt tttcctttgt ggatgaagtg ggagtaagac ttgttgctgt gaggatcaga 240 

tgaagtggct aggatatgga cacactttac ttgaattgga aaacaagcca tgtatcccta 300 

atctgcaaaa tgtggcatgt cacacgtgta atctctgagg tttagttttt gctcaagatt 360 

gcaaaggtga cttgcttgat gctttctttg cttgagcaca catctcattc attaaatggt 420 

gtctcctttt ttgcacacag gatgcagaac ataattgacc ttttccaagt ctacttagca 480 

gaaatgaaaa tggaatcata taaatacagt attatacttt aaaataaaaa ggctgtacaa 540 

aagtttggct gacatagctt gcttctagta atctgaatgg cttatttaaa taaagttgga 600 

tctatggact cttcacagnc tagatattat cctactggaa gatgtgcctc gaaagctgtt 660 

gaaccacngc aaaaaaaccc ttcagtcagc acgtgagaaa acctgcgagc ccacatttcc 720 

cccgggacca ttctgaacat cctactgg 748 

<210> 584 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 584 

tttaatgctt gttacacgcc ttctgcagga tttatcgatt cnaattcggc acgaggctat 60 

gtattgtgtc ctaccatgaa ttcactccat gctagccaca ttggcctgta tggctattcc 120 

ttggacacac ctaggatgtt cttgcctctt agcttgccta cctttctctc atcatttggg 180 

cctcancgag gatatcatct cctcagagaa gccttctgtg accatgctat ctaaaatact 240 

ccagcacttc agtcaccctt tatcccatta ctctgctttt tcagaaacat tggtgctccc 300 

tgaaacatat ttgtttactt gcttagtgtc ttttctcccg cactaccatg taagcttctt 36 0 
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gagggttaag ggaccttgtt agggataacc actgtatcct tagagtgtga cacatagtag 
gttctcaata catatttttg aaactctacc ctgatgcaaa agagatatca aataattata 
gtttttgcat tataaatggc tttggtgaaa tccctggcac aaaactaata ataaaagaaa 
taaacagata atgttgaagt tctgggcctg caaaacctaa ctcttttaaa gcagtcccag 
taaatgtgtc attgggatcc ataagacttt gtgggaaagt caacataatt ttattnggga 
aaaagcattg aaccttcaaa agtnaaaact ttatnggncc aaaatctcaa ttactggggg 
gccgttcttt aagtcatttt aaaccctttg angccnacag ttttacacca aat 



420 
480 
540 
600 
660 
720 
773 



<210> 585 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<400> 585 

ttcaatacnt ntttcnngcc ttttgcagga tcnctcgatt cgatggaaca tgagtggaag 60 

tgggcagtct ttttctttcc c tat cage tg agtgaatgaa gatttagagg gcagcagagt 120 

catgacatgg atgacgttgg gtctctggat ggctaaatgg aagacccgcc ccccaacgcc 180 

actctacccc ectgetttga actatgettt gagaaatgag cttatgagac cactgagact 240 

tgggggctgt ttgttcagca gttcacctac acttattagg aaaggttgac ttcttgtact 300 

acgcctttcc ttaaatcatc ttttgtataa ttctcagaac actgctggtt tgggtggtct 360 

cacacatttc tcacatccaa attttaaaga tttcatgaat gttcattaca gtggatttat 420 

ttttctcttt ctgcttctcg gcatgccctc tcaatttggg agaaatctct aattggatga 480 

ctttggtggg accacaggag tgtaaggatc gtaattccct cacttcatcc ectgeaaatt 540 

aaagcctggg cacttaagac tcactcaact gaatcttgat atgtgggact ttanatctta 600 

agcaaatgan gcaaaagaag gaaaagacag ttgagaaaat caatctctga agttcagcac 660 

ttgatttcca ccgtggaccg gactcctgca netttgeatt ngccttggtt cctggccatt 720 

ttccnaaccc gggttccttt ttgan 745 

<210> 586 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 586 

tgttctaata ctaggtntac tegecttttg caggatctna tcgattcnaa tteggcaega 60 

ggggtcctgg tgggagtnnc atncagcagn ganngcattc tttccncaca ncagtnaacg 120 

gtcttattaa nagccaccac tttnetgang cctgtacagg ccttgnnggt tnggngaaca 180 

gaaatnncgc aggcacttgt accttcaagn anggacttgt gectnactgn nagggttggc 240 

gttgaccttg gctcnacnga catacccant ctgacttnna aegngenegt ctnagcttac 3 00 

getagactge aennccaagn ttgeangect nttntgnctt ccctgcattn accaatgaca 360 

gtacgaccaa cagtcaanga aaagtgccaa gatatatcta tcccatttct tctacacctg 420 

tanattcctn actatgetea aactatgtgg ngcaangaan actggnngac atttttagtc 480 

aatgatgctg acaattaatt actggtgngg ccaggcatat nttcaegget gcttgtgatg 540 

ccaacnaaga acgggcccca gcccatcctt actcctngnc cccaaanaga tccagtgnga 600 

atgggaagct gnnannacca acccaactnn tgatttacca ccaacnccaa anatcacgea 660 

tgnnnacagc aaaacaacaa cncnatgcac ttaacaagna nccnaaaant naactcgngc 720 

ctctaaaact attngggant cctttanct 749 



<210> 587 
<211> 783 
<212> DNA 
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<213> Homo sapiens 
<400> 587 

gttctaatnc ttggcctact cgcctntctg caggatcttn tcgaccttat tcggcacgag 60 

cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 120 

ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 180 

stStg^tggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 240 

ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 300 

cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 360 

attctctcat tttttacagt ttatcagttg cactcatagt tattgtcata atcactcccc 420 

acagtaacct gtaaggcata taaagtagct attttagtaa gataaatgat attttatata 480 

tgttatgata agataaatct tatcatttta agaagaaact gagctcggag agatgaaatg 54 0 
acttcctcag ttgctgctgt aataaaagtc tactttttgc taaaaaaaaa aaannnaaat . 600 

atnntntann attnnantaa naanaaaaac ttcgagccnt tttnaaactt tnantggagt 660 

cnntttntcc cgtaaaatcc nnnactttgg atnaanannc catttngatn aagtttttgg 720 

gacaaacccc ccaacttaga aattgcnntn ggaaaaaaaa ntgcntttta ttttgnggaa 780 

aan 783 



<210> 588 
<211> 771 
<212> DNA 

<213> Homo sapiens 



<400> 588 

tcttctaatg ctggttacan gccttctgcn gatccctcga ttcgaattcg gcacgagata 60 

ctttttaaac cttttttggc agctcagatg gtgtaaattt taaaattttg tataggtatt 120 

tcataacaaa aatatgtatt tcttttttgt tattttatct tgaaaacggt acatatttta 180 

gtatttgtgc agaaaaacaa gtcctaaagt atttgttttt atttgtacca tccacttgtg 240 

ccttactgta tcctgtgtca tgtccaatca gttgtaaaca atggcatctt tgaacagtgt 300 

gatgagaata ggaatgtggt gttttaaagc agtgttgcat tttaatcagt aatctacctg 360 

gtggatttgt ttttaaccaa aaagatgaat tatcaatgat ttgtaattat atcggttgat 420 

tntttttgaa aagatgaacc aaaggatttg actgctaata ttttattcct tacacttttt 480 

tctgaataag tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatngt 54 0 

gccatttgta aaatnanaat aagagcagaa aaaacacaaa nangagaaca ctggnttcag 600 

acattcantg gggcaagtta aattatggga ctgcaaaaat aatggatttt ttattcaaag 660 

aaaagcttta aaaagtttta ttatccanat ttacaaccca ctanttaagc taaataancc 720 

tactttnaaa aatngnaaat ggttnctatc tttataangt gccaanttna n 771 

<210> 589 
<211> 844 
<212> DNA 

<213> Homo sapiens 



<400> 589 

tncactnnaa tccttntnta aaaagccttc tgcntgatcc catcgattcg aattcggcac 60 

gaggccagag cctagaggag agatcaaaga cnttngccga agtgaagccc attctgcaag 12 0 

caactgggtt cccatggcat gtggtggcct tagaggaggt gttcagcctg ccaccgtcgg 180 

tgctttggtg ctctgcccag gagctggtgg gatccgaggg ggcctacaag gcggccgtgg 240 

acagcttcct ccagcagcag catgtgctgg gggccggggg tggtcctggc ccgactcaag 300 

gggaggaaca gccaccccag cccccgctgg acccccagaa cctggcaaga ccgcctgccc 360 

ctgcccagac tgaggctctt tcccaactgt tctgctcaat gaggacactg actgccaagg 420 
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aggagcttct gcagaccctg cggacccacc tgatcctnca 
actccaaggt catgactggg gacagntgca cacgcttggc 
tgcnctgggt ccaaagggcc ttcctggcct gggatacngg 
ccgggngaac gtnggtggtn ggtgccggnn cattgcctgg 
angaangntc gnttatttct aacaaaaccc ggnccttgnt 
tggnnnnttt tnaanacncn annncccaat tngnaanaac 
aaaaaaangg ggccnatatn ntntntcana cccgnggnct 
tnct 



cgtggcccga gcccacggct 
tatcaagctc atgaccaacc 
ctttcttgga tgaaccngna 
gaaccaccac ccccttnaaa 
tcntaccntn ttccctntct 
ccnaaaangg gnccctttgn 
ttgaatnngg aaaangccnc 



480 
540 
600 
660 
720 
780 
840 
844 



<210> 590 
<211> 767 
<212> DNA 

<213> Homo sapiens 
<400> 590 

tctaatgctt ggntctngcc ttttgcggat ctttcgattc gnattcggca cgagagaacg 60 

ttctcaggtt gaccagctgc tgaatatttc tttaagggag gaagaactta gtanntcatt 120 

gcagtgcatg gataacaatc ttctgcaagc ccgtgcagcc cttcagacag cttatgtgga 180 

agttcagagg ctacttatgc tcaagcagca gataactatg gagatgagtg cactgaggac 240 

ccatagaata cagattctac agggattaca agaaacatat gaaccttctg agcacccagg 300 

tttggcatag aaatggtacc ccttgttcaa aatgaacaag aagccttaga tttggatggg 360 

gaacctgatc tgtccagtct agaaggattc cagtgggaag gtgtttccat ttcctcgtcc 420 

cctggcttgg caagaaagcg aagcctttct gagagcagcg tgatcatgga cagagctcct 4 80 

tctgtgtata gcttcttcag tgaggaangt acaggcaaag aaaatgagcc ccagcagatg 540 

gtttcaccta gtaactcatt ganggctgga cagaaccaga aagcaaccat gcaccctcaa 600 

acaaggaagt nacacctcng gctggccttc ccttccgaac aggtgaaagg ggcttgaaaa 660 

atgttgctac cccaaaggcg acattnntgg caccaaatta tccctcttga ccnntttaat 720 

accttttgat tncatttngg caaaagactt tgnaccagcc nnggaga 767 

<210> 591 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 591 

tctttgaatc cttttgtaaa agccttttgc atgatccctc gattcgaatt cggcacgaga 60 

cttcttgttt gcctttttta taaggaaatg ttggagagtt acatcattgc taatgtagaa 120 

atgttaagtg gaaaaatata cagtttggta aaataaacta gattctacat ttatttgtgg 180 

gtttttttcc cctcctttct ttccacagca cttttgatat caagcaagtg gcttcctttt 240 

tgagatatta aaaaaaaaaa gaaaaggaaa aaagtaaatg aagcccaact acctaaccct 300 

ttcttatttg tatttgtttt agtattgtga agttgtgtta aatagtacta gctagaaata 360 

caaatttctg gttatcattt ctcttccctg tggcacttga cattttaatt gtcttaaagt 420 

ttttgaagtc atcttctggc cccttgagta ctgccagagg caaaagatgt ttgtttctta 480 

ttcattccac ttttgtctcc tgggatccct tctgtagcct aaagtatggc tgggaaatgg 540 

acttgagaag attggcttga attangatca taatcatgtg tgatcccatc atgaattcat 600 

tggaatntgg ggtncatgta angcaatcnt tctggtgtaa atcttccttt ttttaatgna 660 

catatanttt tggaaaaaat tttgaattaa ccctgaaaat ttttaaaaaa gccctcttan 720 

aactattann ggaggtcnca ttaccctaga atccanacat tnant 765 



<210> 592 
<211> 757 
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<212> DNA 

<213> Homo sapiens 



<400> 592 

tnttcnaana ctngttctng ncttttgcag gatcccatcg attcgccaaa tctgcctaga 60 

gattgagttc acagtgtatg ttctgggggc gctggtgcag tcagcggtcc agtctccagc 12 0 

ctgcaggcgt gcacactggg gtggacgatg ggtggccccg caggtgtaca catttgggtg 180 

gccccggccc ctatacccca gtgttctctt tgatccagtc ccgaaacaga gggagccttg 24 0 

tgtacacgcc tncaaagtgg agctgggagg tagaagggga ggacactggt ggttctactg 300 

acccaactgg gggcaaaggt ttgaagacac agcctccccc gccagcccca agctgggccg 360 

aggcgcgttt gtgcatatct gcctcccctg tctctaagga gcagcgggaa cggagcttcg 420 

gggcctcctc agtgaaggtg gtggggctgc cggatctggg ctgtggggcc cttgggccac 480 

gctcttgagg aacccaggct cggaggaccc tggaaaacag acgggtctga gactgaaatt 540 

gttttaccag ctcccaaggt ggacttcant gtgtgtattt gtgtaaatga gtaaaacatt 600 

ttatttcttt ttaaaaaaaa aaaaaaaaaa actcgancct ntanaactat tagtgagtcc 660 

tatttacctt agatncagac atgataagaa tncattgatg aattttggac aaaccacaac 720 

ttggaatgca ntgaaaaaaa atgctttatt tgtgnat 757 



<210> 593 
<211> 766 
<212> DNA 
<213> Homo sapiens 



<400> 593 

tcttgaatnc tngttnntgc ctttttcgga tccctcgatt cgaattcggc acgagagaac 60 

attggtgtgt gagtgttttt tgatggtgca ggacccggag gtgctttcct tgccaagaat 120 

agaaacatcc agaatgctcc tccccatccc ccaatcccag acagcaatta tgtcagccct 180 

gtaaggcatt gcctgctctt gaccctttgg cccatctttt tatttttaaa aaattcccat 240 

gtcacagatg ccctgtctat gcagagggtg gcgtgggatg ggtgaccact aagtttaggc 300 

tggtgaaggt ggtgagccct tctgaggccc tgatagaact ttccaggagt tcatggtccg 360 

cggctccagc ttctcactgt aaagttgtca tcctggcaga ggcagccaat gcttttcatt 420 

ctagggggta gagatttatg ctaatgagtg aatattgcac cactagtgac tttctgttta 480 

aagttcagct cttagaaaat ggaatcttac ctgaccccta gtgaattatg tacataagca 540 

gggaatgttt ccaactagat ctccttcaga agagtccctg tgctggaata ggtcactgaa 600 

tcttatttgg ntttgtnaaa caaaagcttt tgggtctcgt ggggtgtgtg tgtgntttgg 660 

ngtgtgttgc cccntntgcc gtttcaaata aaaggtttgg taccaccttt tcaaaaaaaa 720 

aaaatantnt anntnanant nntntancnt ttnttnncnt tanant 766 



<210> 594 
<211> 754 
<212> DNA 

<213> Homo sapiens 



<400> 594 

ttgnttagga tcccatcgat tcgaattcgg cacgagggaa ggcagtggga ggagaggacc 60 

aagtctcaaa ctccagaagc cccacctccc tgagctcagc tcctctgcca agccccctca 120 

gcgcgaagtc ctcgtccaga gaaggcaacg gcgagaaaca aatccaacat cctgggctgc 180 

tttttccttc ccccactttt taaaagtttg gtgtccaagt cacttgacaa acccagaccc 240 

taacaatgat attttgtgta gaattctggg atcaaaatat aatttcaaaa ataatatatt 300 

ttctgacatc ccccaaaaaa aaaaanaaaa aaaactcgag cctctagaac tatagtgagt 360 

cgtattacgt agatccagac atgataagat acattgatga gtttggacaa accacaacta 420 
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gaatgcantg aaaaaaatgc tttatttgtg aaattttgtg atgctattgc tttatttgna 
accattataa agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 
aggttcangg ggaggtntgg gangtttttt taattcngcn ggcgcngcnc caatgcattg 
gggccccggt nncccanctt ttggntccct ttaagnngan gggtaaantg ncgcncttgg 
cntaatcttt gnncatnggt tggnttnctg nggngnaaat tggttttccn ggnnanaatt 
tccccncatn ttangatccc nggnngntnt aang 



480 
540 
600 
660 
720 
754 



<210> 595 
<211> 767 
<212> DNA 

<213> Homo sapiens 
<400> 595 

ggtttaatgc tgttnnaanc cttcttnanc ctttgtacag catccctcga ttcgaattcg 60 

gcacgaggaa cgcttccatt ttatacctgt gtctagttag tttctgccta tctatccaag 120 

aagcttttat caagggtcca ccatgtgcca gccactgaag tagatataaa tacaaggatg 180 

tgtaaggtat ggatgatggt atacgaactg tcatcttact ggatttgtcc gctctgttaa 240 

agatacggtt ccgaaaactt tttaaagccc tagagagggc tttaaggcaa tgtagcatca 300 

tatatagagg catnaacctg ttcatatctt tctatttaac agaactgtgc acctgggcac 360 

aagggtgtgc acaacaggat gtgtacagca gcactgttaa agtgtancac atccatacta 420 

cangatctta tgcaactgtt ggaaagaatg aagcgatgct gcactgtggt catgcagtga 480 

tctctaagac atattaactc gaaagcaaaa ggtttaacaa tgtatnacaa actgggctgc 540 

aattgactcg cgcctgtaat cccagcnctt tgggaggctt gantaaggcg gatcacctga 600 

ngtcangagt ttgagaccaa acctggccaa tgttggccna aaccnctgct tctactnaaa 660 

ctacnaaaaa ttaacctngg gcntggttgg ctccgtgcct tntaatcccn gcttactcgg 720 

caatgcttga gngaangnan aattngcttt gaacctnggg gaggnng 767 

<210> 596 
<211> 743 
<212> DNA 

<213> Homo sapiens 
<400> 596 

tnttnaatnc tnttttaatn cttgctgcan gatctttcga tgatcccatc gattcnctgg 60 

tctcgaacac ctgacctcag gtgatccatt cgncttggcc tctcgaagtg ttgggattcc 120 

aggcgtgagc cactgcggcc agcacatttc cacttntaga tec tact cca taccacaggt 180 

ttcatttaag angaaaganc tanataaatg tgctcttntg gataccccac cctgacagan 240 

tgcattgtta cacagntanc atgggttgac actgeaanct ggcctgtcag ccatnggagg 300 

ngtttannga aaggcanatn atgtnactct gtgncagggn gecatntget tacccntnac 360 

ctagcatang gggnttctac gggtgacccc nagcatattt ctaggttact tatgggcaga 420 

tttgtaagtg acaaaactcc agetgatget gggaatgggg agagggcect tganggactt 480 

tgtggntttg tgcttctggt ttcctggcca accccagggt cacttgtctg gagcccagct 540 

gggcactaat gtctgccanc gactatntta cagtgtataa atgattcctc tatttgggga 600 

gagatcttcc aatccagaag agcccctntt ggactgcctg ggttaaatct gcatagcana 660 

agtggttgat gagtcatctg aagaaattca gccccaactt nncaacctgc ccttcctgnt 720 

tcctttttaa tggnggcctn tgg 743 

<210> 597 
<211> 786 
<212> DNA 

<213> Homo sapiens 
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<400> 597 

ngtttnnncc ngtttttaat ncnttgctac tngctctttt tgcaggatcc catcgattcg 60 

aattcggcac gaggacanac cgttgagagg acgtggaggc ccnttagggg gtntgcncng 120 

nanaggcaga ngtggccctg ggaacagagt tttatgacnc ttttnaccat anangaangn 180 

gagaatttna aagatatggt gggaatgaca aaatagcagn cataactgaa gacaacatgg 240 

gtggatgtgg agtttggnac ctngggatcg ngnaaagata ccagtgatgt ggagccaact 300 

gctccgatgg aggaacccac agtggtggag gagttccant gcanccngga agaggagtat 360 

ccagcctaag ttnctgactg gatgtcaaga agaaacccaa nttataanag atgactntan 420 

ntgantggnn aaatctttca gatcanncca gaccatancn tgagtttaac atccgnaanc 480 

cacaatccan tgnnccttac taagccgtgg tgattnacaa gtcataaatc cattanatga 540 

tgtggtnaaa gatgcctatn atgaccnatt ctccatngtt ntccngaaac ccgtcaattg 600 

acatcacatn tcctnttgga gattaaattt tnggtnancn tnccttcgtc cttgggcatt 660 

ngaacncata agaatgcacc cccnggntag gcccngtnna aaggttnatg aaggccntta 720 

taanttttgn nnccccaanc attaaantgg ctngattccc ttaatntttt cctcccnaac 780 

ccagnt 786 



<210> 598 

<211> 809 

<212> DNA 

<213> Homo sapiens 



<400> 598 

ngttttnnnn cnnnttttct aatgcttgct tctcgttcct ttgcaggatc ccatcgattc 60 

gaattcggca cgaggacaga ccgttgagag gacgtggagg cccgagaggg ggtatncncg 12 0 

gcagaggcag aggtggccct gggaacagag tttttgacgc ttttgaccag agaggaaagc 180 

gagaatttga aagatatggt gggaatgaca aaatagcagt cagaactgaa gacaacatgg 240 

gtggatgtgg agttcgaacc tgnggatcgg gtaaagatac cagtgatgtg gagccaactg 300 

caccgatgga ggaacccaca gtggtggagg agtcccaggg caccccggaa gaggagtctc 360 

cagccaaagt tcctgagttg gaggtagaag aagaaaccca agttcaagag atgactttag 420 

atgagtggaa aaatcttcaa gaacagacca gaccaaagcc tgagtttaac atccggaaac 480 

cagaatccac tgttcttcca aagccgtggt gattcacaag tcaaaataca tagatgatat 540 

ggtaaaaaga tgactatgag gaccattccc atgttttccg gaaaaccccc cattgacatc 600 

acattccaac ttggagatta aattttgggt aacccttcct ttgtncttgg gccttngaac 660 

ccntgaagga aggcaccccn ggtgaagggc ccngggggaa agggattcan ggnaaggggc 720 

cantaanaaa ccttttggga ccccccttaa nccaataaaa tttggtngaa ttgcnangga 780 

atggtttgnc cccccnaaac cccnaaant 809 

<210> 599 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 599 

tttntaatnc tttttcnaat gctngcttca ggannntntg cangatccct cgattcgaat 60 

tcggcacgag ccaggttagc tgctgaatca aagcttcaaa cagaagttaa agaaggaaaa 120 

gaaacttcaa gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctatatgtg 180 

tcttctaaat ctactccaga gacccagtgc cctcaacagt aaagactttt ctttaataag 240 

agtacggtgc cacttgcctc aaaagttact atggtgctta agattgtctt gatctgacat 300 

atatcacctt ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg 360 

accaagtgtt cagaatttgc ttgactctaa cctggagagc ttcttaagtg atgccccttc 420 

atggagcttc tatgacagtg aataaactat taattgaagg aaaatgttat aattaatgta 480 
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tctatttgct gcattgtata tggattaaat gataaaaaac aagtaatcta ccctcagagc 540 

catgtatttg agaatgcttc aatcatattt tcctatgtac ctttttttta taaacttagt 600 

tttagactat gttgtaaaaa tggggaaagg ttgtaaacta tgtngtaaaa aatngggaaa 660 

tgtggcttta aaatatatnc attatatttg gttcaaggat tttggcaggg gntaaaggaa 720 

ncnatggttc aatctttgna tttatatacc ntgatttaa 759 

<210> 600 
<211> 769 
<212> DNA 

<213> Homo sapiens 



<400> 600 

ttttaatacn tttttnaatn cttgcttncg ntcctttgca ggatcccatc gattcgaatt 60 

cggcacgaga gcaattccac tcctagctcc acccacaggt aattgaaagc aaagacgcaa 120 

acagatgcct gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc 180 

gtcacagcgg natcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc 24 0 

ttcgccacag cggcagcatc tgtcgtcaca gnggcagcat ccttcgccaa agcggcagca 300 

tccttcgtca tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc 360 

actgaggcgt gcatagacta aacatggcca gtccaggcac tggaatccag gccgtanaac 420 

ggngcccacn gtcaaaagga atgagaccct gatgcactgg gcgacacaga cgggcgacac 480 

agacttggag acatcatgct aagtgaaaag ccaggcacac ggagcggacg gggtgatcct 540 

gctcacgtga tgtgtcccga atgggcacnt tcagagggga agaanggaga tggcgcttga 600 

cngtgnccgg gacnggggtt gggagcgacc ggttgttggt ttngggtttc tttctngggt 660 

gaaggaaatg tttttgatat tggggccgtt tgggtgatnt ttgcattacc ctttgaatat 720 

gcttanaacc cnctagaaat tgnnacactt tttaaatngn ttggaaatt 769 



<210> 601 

<211> 755 

<212> DNA 

<213> Homo sapiens. 



<400> 601 

ntgtttaata ctattttcta atacttgctt tcgttctntt tgcangatcc catcgattcn 60 

aattcggcac gaggagacag cagcccccag ggaatgaagc tgatgccaga gtcagacccg 120 

aggaggaaga ggagccactg atggagatgc ggctccggga tgcgcctcag cacttntatg 180 

cagcaactgc tgcagctggn cctcaagtac ctctttatcc ttggtattca gattctggcc 240 

tgtgccttgg cannctncat ccttngnagg catctcatgg tctggaaagt gtttgcccct 300 

aagttcatat ttgangctgt gggcttcatt gnnagcancg nnggacttnt nctgggcata 360 

gctttggtga tnagagtgga tggtgctgtn anctnctggt tcangcanct atttctggcc 420 

agcagatgta nnctatatct gtgattactg gcacttggct acagagagtg ctggataaca 480 

gtgtagcctg cctgtacagg tactggatga tctgnaanac aggctcagcn atactcttac 540 

tatcatgcaa ccaggggccg gttgacatct aagacttgnt tattctatag ttcnagganc 600 

acaatggaat atgatccctt aactcctgat ttgggatcat ctgaaggacc aaggnnggca 660 

gtcttcgaag tggaataaaa tagccccggc ngtngtgact tgcacctata ttcccagact 720 

tttgggaggc naannttnga aggattgntt gccct 755 

<210> 602 
<211> 773 
<212> DNA 
<213> Homo sapiens 
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<400> 602 

nttgtaatag ctggtttcta aannntngnt ttcaacccct ttgcatgatn ccatcgattc 60 

gagcaaatca agatcttcag gtacagttgg accaggcact ccagcaagcc ttggatccca 120 

atagttaagg caactctttg tttgcagagg tggaagatcg aagggcagca atggaacgtc 180 
agcttatcag tatgaaagtc aagtatcagt cactaaagaa gcaaaatgta tttaacagag ' 24 0 

aacanatgca gagaatgaag ttacaaattg ccacgttgct acagatgaaa gggtctcaaa 300 

ctgaatttga gcagcaggaa cggttgcttg ccatgttgga gcanaagaat ggtgaaataa 3 60 

aacatctttn aagtgaaatt ngaaatctgg anaaatttaa gaatttatat gacagnatgg 420 

aatctaagcc tttagtcgac tctggtactc tggaanataa cacctattat acagatttac 48 0 

ttcatatgaa gctggataac tnaaaacaat agaaattgaa ngcactaaan gtgaattgtc 540 

atacaagcga aatgaaancn ttatttgana gccngcgggc ttctaacata ttgagcgata 600 

actttttgca aatgaaagat gcccttcngc tttntgaatt gnaaaatatt gaaacctgan 660 

agntnanctt agntgaattg aaacttaaat ttgaaccctg nacnanaccg gttaantgcc 720 

tgttcctgat aaaaanaagc cntnangtgc ttncctgntn gatttanccc ccg 773 



<210> 603 
<211> 784 
<212> DNA 

<213> Homo sapiens 



<400> 603 

tgctttntaa tagctgtttt taaatncntn gctttgcgct cnntttgcag gcatcccatc 60 

gattcgaatt cggcacgagg gggacatcag tgatcgtaag tctcctgggn ccgttattct 120 

canattaggt gacggagcta agacttcgag accatctcgt cctttntgta tcgcggaaac 180 

ctgangaacg agccggcggc ggtgacctgc acgagaagcc aggctaactg ggtgaagtac 240 

catgcaagca tttcttaaag gtacatccat cagnactaaa cccccgctga ccaaggatcg 300 

aggagtagct gccagtgcng gaagtagcgg agagaacaag aaagccaaac ccgttccctg 360 

ggtggaaaaa tatcgcccaa aatgtgtgga tgaagttgct ttccaggaan aagtggttgc 420 

antgcttgaa aaaatcttta gaaggnngca natcttccta atctcttgtc ttacggacca 480 

cctggaactg gaaaaacntc cactattttg gcagcaaact tgagaactct ttgggcctga 540 

acttttccga ttaagaattc ttgagttaaa tgcatctgat gaacctggaa tacaanttag 600 

nttcganaag aaagtgaaaa atttttgctc aattaanctn gtgtcaagga aaatngnttc 660 

anatgggaaa gccgttttcc ncctttttaa gantgggaat tcttngatga ngncnaattc 720 

ntnttganct taactgnntt angcagcttt taaaaaanta ccattggata aangagtccn 780 

aant 784 



<210> 604 
<211> 801 
<212> DNA 
<213> Homo sapiens 

<400> 604 

gttncnnctn aacccttttt tgaaatcnnt ngcttctact ctttggcatn catnccatcg 60 

atncggcccc gtgtggggag acngacagca ccctttttnt ctggcatttg cccttgangc 120 

tatagcgcct cccctctccc ctcagagggc acagctgcag gcctgaccaa ggccacgccc 180 

ggctctcgtg ctctaggacc tgcacgggac ttgtggatgg gcctggactc tccagaaact 240 

acttgggcca gagcaaanga aaacctcttg ttttaaaaaa attttnttca nagtgttttg 300 

nggaggagtt ttagggcttg gggagaggga ggacacatnt ggaggaaatg gccttctttt 360 

taaaagcana naacacataa ccttacaact gcctggcaag cccaatatca cttgtttggg 420 

ccctancggg actccaangn agccacacgc cccttctgga agggtgtgng catgtnaant 480 

gtgtgccanc gcgtgggctg gcgtgtgaan atctatnaaa taagtatana tgggngtnta 540 
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ntatatgtgt ntaaaataaa ngantggaca tatttggncc tctgngnana nncttngaat 600 

ctaagncaag agtnnntctn gaaaaacnaa ananagtnct ntntanannt ttacgtaatn 660 

atcaatactn tntccacntn accctnctnn tanntntncc natatantcg antaattcnt 720 

cactcntnna ttcctngtna acacnaatna atnnaactat naaatatntn tnctnnntan 780 

tngacatann catncnnncc g 801 

<210> 605 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 605 

gnttctaatg tggttcnaan acttgctttn gctcctttgc aggatcccat cgattcgaat 60 

tcggcacgag agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa 120 

aggggacttg ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac 18 0 

cctccaagtc tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg 240 

ggcttctatc tggcaccagg ctgcagcctc cccaatccca gcccactttg ctgtgtctct 300 

ggcgggctgt cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc 360 

ctggccttca cccatnccta gccagcagct cccagtcaga caacagccag aaatgtctcc 420 

agactctgcc cagcctccca ggtagccacc ctcgagacat gacctcagag tctctgtgtc 480 

tcctagaagc ctgacagaga cccccanggc agtgggtggg tggcgggcta gagacccttg 540 

cctgtgtccg ggaccctggc gccgntcttc cctcctgtgg atcccttcgc acttacaagt 600 

gttctnaant gggcagacgc ctgggcaccc cttgggccct gcccaancat ggccatngng 660 

cangcttttt naacccgcat nggntttcca ngcctggtga atcttgcttt tccanggacn 720 

nnttggaacc tttcctncgg ggcggggccc ccnagcnct 759 



<210> 606 
<211> 809 
<212> DNA 
<213> Homo 



sapiens 



<400> 606 

tctncgtnaa tcnnnnnttt aaaagccttt gcttttgctc nctttgcttg atcccatcga 60 

ttcgtgactt tgtacctggt ccaagctgat ggggttttgc tgctgttgac ccaggcagga 120 

gtctgactag agaacaaact aaggttgctg caacaaacaa ggacctcttc caagaagggc 180 

tcccaggcct ggcgcagtga ctcatgcctg tgatcccagc acttgggagg ccnaggcggg 240 

tggatcattt gaggccagga gttcgagacc agcttggcca acatgatgag accccgtctc 300 

tattaaaaat acaaaaatta nccaggcgtg gtggcgcctg tagtcccaac tactcaggag 360 

gttgaggcag gagaattgtt gaacccggga ggcggangtt gcaatgagcc aanatagcac 42 0 

cactgcactg catccttggg tgacagaagc gagactccat cttaaaagaa gggctcctgt 480 

gtctacgtca tggtggggct anagagangt cccngcagct gggctgtgtt gagtganngg 54 0 

ctnntctttt naannccagg caatagtttg tcttgactct gtccttttct gngtccacat 600 

gacattttac atntttncnn agtttnctta atttaaagtt gnctaatttt accattatac 66 0 

attttnaatt ggcatttctt ttaccnatnc tttttgtntg aaaatggtan tntttgaaat 720 

cngnatcngt tctaatgngn tntattttna ccnaatgcca atnntacctn ctttgnaana 780 

atntattcgt tttcnaagnt tnaacctct 809 



<210> 607 
<211> 788 
<212> DNA 

<213> Homo sapiens 
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<400> 607 



tntttctaat acnagtttnc 
cgcaaggccc gaggtgccat 
tactgttcgg taaagagctc 
accatcttcc cctgcaagga 
cacatgaaaa ctcacaggca 
tttgcagctg agatggcagc 
ctgctgcccc tggaccagct 
gacgacgtcg tccaacantt 
cttctcttgg atgatcaaga 
gtgtncactt atagacagtg 
ggaatgcctg ctttgttttg 
atttttantt nacgntttan 
aaaaattngn tttttgttgg 
taaaaaat 



aagncttgct ttnnnatccc tttgcaggat cccatcgatt 
cccctctggg aagcagaagc ctggtggcac ccagagtggg 
accctctcac agcaccacca gcggcgagac agaccccacc 
gtgtggcaaa gtcttcttca agatcaaaag ccgaaatgca 
gcaggaggaa caacagaggc aaaaggctca gaaggcggct 
cacgattgag aggactacgg ggcccgtggg ggcgccgggg 
gagtctgatc aaacccatca aggatgtgga catcctcgac 
gggaggtgtc atggaagang ctgaanttgt ggacaccgat 
ttcantcttg cttcatggtg acgcagaact ataaagccct 
aaaacccacg ggtcttcatc tttattaatc nngaaacctt 
taaccccttt ttaaaaccta cctgttttta aaaagtggtc 
aaanaaaaan tcctatttct ttttcctttt nattttaaaa 
ggggntttgg ggggaattaa aataatttgg cccccaactt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
788 



<210> 608 
<211> 796 
<212> DNA 

<213> Homo sapiens 
<400> 608 

tcttttaatg ctttttncaa gccttgtttn aaatcctttg caggatccca tcgattcgaa 60 

ttcggcacga gactaccccg gctacggttc ccccatgcct ggcagcttgg ccatgggccc 120 

ggtcacgaac aaaacgggcc tggacgcctc gcccctggcc gcagatacct cctactacca 180 

gggggtgtac tcccggccca ttatgaactc ctcttaagaa gacgacggct tcaggcccgg 240 

ctaactctgg caccccggat cgaggacaag tgagagagca agtgggggtc gagactttgg 300 

ggagacggtg ttgcaagaga cgcaagggag aagaaatcat aacaccccca cccnaacacc 360 

nncaagacag cagtcttctt cacccgctgc agccgttncg ttccaaacag agggccacac 420 

agaatacccc acgtttttat ataaggagga aaaccggnaa aanaatttaa aagttaaaaa 480 

aatanccttt cngttttaca ctactgntgt agactcctgn tttcttcaan cacctgnaga 540 

ttcttgattt ttttgttgtt gatgntctct ccattgcttg tngtttgcnt gggaantttt 600 

atttaaaaaa aaaaaaaatt cttgtgagtn gactttggnt tttaaaccan tgntagattt 660 

taacngnacc cttaatgggt tgtacntata tgntttnaaa acatgnnaan aaatatttaa 720 

tgtaaaggnn ctgttnntaa atntaaccac ntanagaant tnnaaannnn ttnanccctt 780 

tagaacnatt nntgng 796 

<210> 609 
<211> 790 
<212> DNA 

<213> Homo sapiens 
<400> 609 

gnnntttaaa nacctntttc aatncttggt ttnnaatcnt tttgcaggat cccatcgatt 60 

cgcatccagg gagaacctcg gggctgggac acctcctggc cctcaccctg ggtcatgttt 120 

acagtcctca gtgccccaca ccggtggccc cctgaggaca cctccaccct gaccttgatt 180 

ttcccaaacg ctgcctcttg gtgacagact cagcccaaaa ccccttcctt ctgtctctgg 240 

agacccttga gcttggggaa atatggaggg gtgtgtgtct gcaatcaagg cctctgcagc 3 00 

tcacggctgg cccggtgggc tgggacttcc gtctgaattt taaatactta gggntcattt 360 

tttttctctg gcaacaaagc ttgatgtttt cactgcttta gtttcctgtt tgctggtggg 420 

aggggatacg gtctgtgact ctggacttgc tctgggggaa cagttgtcac tgcccccngg 4 80 
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gagaggggca gcttgggctt ggaagaaagc acaccccnga gaccagagcc ccttcnagag. 
ggatncttgg ctgcttcatt gnctttcccc cagcaagccc tgctcttcca caagcncctt 
ntggggtctt gggtatggtc ccccgntcac cttctttcca nantccctga nntggtgtag 
ggttgtgggt tggcacangg aattttgggg cattggggaa ggggntttca aaacttttnc 
caaanacccc cgtgttcctn ngnaaaattn aanttggtgg gcttngggtg ntnaccccca 
antcttngnc 



540 
600 
660 
720 
780 
790 



<210> 610 
<211> 786 
<212> DNA 

<213> Homo sapiens 
<400> 610 

gatgtttnnn annctggttc taatncttgg aaanctncnn ctttgttann ngcnntttct 60 

gcaggatccc atcgattcga attcggcacg agcccagctg gacctggtgg ccctttccta 120 

gtgcctctgc tgggggagga gaacctctgt ccacgtggag gctaggaggt ctcaggtgct 180 

gccctggcag caccagagtg tgggccgggc ccgagtgtct gcccctcggc cctcagggtg 24 0 

gggcacttag cacccagaag ggaccaaaag cagggcatgg cggtgcagag gagtttggga 300 

ggtgtaaaca gccccatgca cgtggaggag gagctggctt tcagccccag accccacgct 360 

agcactttcc acgctgcttg cccgctgttg atgtgcagtt cccagtgcct gtgtgagccg 420 

acatctgctc agtcctatcc ctcgtcagcg tgtggagacc cagctcctgc aagcccttct 480 
gcttccacgc ccccagacag cttggtggag ggtcctgcat ctgggccaag ctggggtgca ' 540 

cccagccaaa gacaaagctg ccttcacgtg cccaaaggat tcaagatggt gcactggccc 600 

cgggaggagt cttgaccaaa aatgggagcc cgctcttgtg gggaaanccc cgacttcccc 660 

caccnanaaa ccgntcccac ggtgccggan cttccccctt ttcctttgtg ggggcaacaa 720 

nattggcctt gggcnctttc aattnttncg gaagctttcc tgggtgtngg cttttgacct 780 

taaaat 786 

<210> 611 
<211> 938 
<212> DNA 
<213> Homo sapiens 

<400> 611 

tgttttaaag ccctntttng aatncntggc ttncgncccc ttggcaagat cnctctctgc 60 

aggatcccat cgattcgttt gtatttttag tagagacagg gtttcttcat gttggtcagg 120 

ctggtctnaa actcctaacc tcgtgatccg cctgcctcga cctcccaaag tgctgggatt 180 

acaggcatga gccaccatgc ccagccaaag atcatttttt tatatagact tcagnccttt 240 

gtaaatattg taactgggga gtatagagta gaaaaaaagt atagntaaaa catttgttct 300 

acaaattaac ctttaaaaat ataattactg ctaaaaatag agtgctgtta cacttaagga 360 

aaattagtgc cattttggaa atgagatctt gtgccataaa tncagctgaa ctgaatataa 420 

atgttcacaa attaatgctg tnaaaggaat gagttaagca gaaaaacttt taaccagcac 480 

ttctcaaaaa anaaaannna nnaattaaat nntataancn ncatnnanat ntatnntann 540 

tttncntctn nattncanta attttgtntt ncaaatantt nnacctnnan ctntgttntn 600 

nttnnnncna tnnantatcn ntttatcnan tatatnatta nctnattntn nngnanngna 660 

tcntnctcta tncnnnatnn tncatatnnc gtccnntnnn nnaantatgc ctcatnatat 720 

ntacnnnaaa ngtntangta tgnttantgc atnnncatna ctnntgatgt cnnagtnnna 780 

natattttgc cnctcattat tntgctnatn tatntgtttg acacannata ctnnnancna 840 

ttcatcttct cgcaatnngn gnacttttna nttacnnnna tgntannnnt natatatnta 900 

tcattagana ccttttnaat tntnnntncn nanacgcg 938 
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<210> 612 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<400> 612 

tgtttgnaan nncggntntt gaaatncntg gtacrmaaac nctttngnaa ancncccctc 60 

nctgtntgat cccatcgatt cgaattcggc acgagataga aactaggcac tgatttgttt 120 

atatttntcc tgctcgagac acatgatgtt tcatgtatct gtggcttttt atagtttaaa 180 

ataatttctg gaaaagtcat agtcattatc tctttaaccg etccctctct tccattctct 240 

ttgttctctc ttcctcgaac tcctgttagt catttgatcc tccatatctc tgaatatttt 300 

tgtatttctt ttattattta tttcttgtct ctgctacatt ttacattgag taaaagtggg 360 

atgtgacagt gggaaatcat tagtgactta gaaattccag ttggtcattg ggccaatttt 420 

gatgctacct tctctctttt atttctcact tcaaaataaa atttgcaaaa acaaaaaatt 480 

aaatatagta tgagtccagt tactggccta aggagctaaa agcattctgg gtttgtatga 540 

agacagctga gttataacaa atgagagtac tgttgtgtga ctgcattaat tattcccttt 600 

ttaaatgtac aagagcaang cattctacct gactgngtta ttgagctctg cancatacat 660 

ggtgacanag ctaaaacaan acaagccnaa ccnanaagga aaaccccagc tttagggata 720 

ctctgntcat ngaatatagc ctgaaaaatg gntaatcaag aaagtnaacn t 771 

<210> 613 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 613 

tttgaatcct tgctttcaaa tncttggcac tngccctctc tgnaggaatc ccatcgattc 60 

gaattcggca cgaggtaacg tgacacgtat tttacttctt ttantaggcg gacacacttt 120 

cttaaagtaa taatacgtca tggccctgct ataaggtagt agttctagaa gactgtntat 180 

ctaataattc agactaaagc tatttatatt gctgtgacac cacgtggaaa acttttataa 240 

ttccatctta tttctgatgt atatgtttta ttttctctgc cttcataaga actaaaaacc 3 00 

aaagttattt acgtgaaaac aagatttttg tttgagttca tttacttgag atatgtttaa 360 

aaaatccacc ttctgtcaca ctatagaagt atattttgaa ttatcaaaag gtagaattat 420 

aactttcana aaagaaaaaa atggtcaatt tantttaact ctatgtcaaa aatttattta 480 

tagtctcata tattcattcc acaccccccg ttcttctttc cttctttctc cctctgcctt 540 

nttcttaatn atnattttta aattctgacc aaaaataaag tngtggcaag tactttctta 600 

gcataacctg gactggttga agnagtaatt ctgntccttt aaaaaaantc cccaactggg 660 

ncccnggnca ggnacaaaaa nttntaanga acatntggga attangcnaa atggatnttc 720 

cttggaggtc caacccccaa aaatcattag gncnaccaaa attnaaaata atcg 774 

<210> 614 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 614 

ttggantctt ctcngaaacn cttngcnatt gcnctntctg naggatccca 
ttcggcacga ggttcttcaa agccaaccaa gacaggcttn tnagttttag 
caaattgcca aaagccagag ttgtttatgc tagtgcaact ggtgcttctg 
catggcctat atgaaccgtc ttggcatatg gggtgagggt actccattta 
tgattttatt caagcagtag aacggagagg agttggtgcc atggaaatag 



tcgattcgaa 60 

agcttcagaa 120 

aaccacgcaa 180 

gagaattcag 240 

ttgctatgga 300 
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tatgaagctt 
aattgaggaa 

gtgggtcatt 

aatgaagaag 
tgcatagcaa 
ggaaaaatgt 
ctttggaaag 
gtgttgcnct 



agaggaatgt acattgctcg acaactgagc tttactggag tgaccttcaa 
gttcttcttt ctcagagcta cgttaaaatg tataacaaag ctgtcaagct 
gccagagagc ggtttcagca agctgcagat ctgattgatg ctgagcaacg 
tccatgtggg gtcagttctg gtctgctnac cagaggttct tcaaatctta 
tccaaagtta aaagggtttg tgccactagc tcgagaggaa atcaangaat 
gttgtaattg gtctgcantc tacaaggaga agctangaac atttagaaag 
aaggccggng ggagaaattg aatgattttt ggtttcaact nccaaaaggt 
cccttctttg aaaaaacatt ttct 



360 
420 
480 
540 
600 
660 
720 
754 



<210> 615 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 615 

tgtttnaatg ctgttttgaa atcttgtttc aaatcctttg gctacttgct ctntctgnan 60 

gatcccatcg attcgaattc ggcacgaggg attctttcac tgagcacaaa gagttgttgg 120 

ggctttagca tctgactgat tttgttacgg ggttgattct gaccatagga agtatgcaat 180 

gtgaatcact atttacagag aaacctacaa cagatgcttg atgttgtaga aactgggaca 24 0 

tatagatacc aagcaaaatt ataagaaacc tataaggtgt tcaatacgct tgtgtttcca 300 

aaattcactg tncatgatca gtttggtgtt cttgtaccac agtttttaac tgaaggaacc 360 

agttgtaaca gtctcaattt ttaactaaaa cttgaagaac taanacaaca atgcaaacct 420 

ttcagcattg tttggccaaa cttgttaaaa ctgtaatgca agaaccaaat gcactgtgat 480 

gtggcaccaa ctaattagca agcatgaatt tttcacccaa nagtgaaaaa aggaaaatct 540 

accatggctt naagtttaag agcagaactt cctgactncc attctatgac tgatcaaaaa 600 

nactaatagt ttaaaacctn agcangcctt gttcacgata tgcngaaaaa aaaagtgctt 66 0 

gcagtttann atccttatgg aantttttca cantgtnaca nggtnttgta atacnttgga 720 

ngccctacat tttcntanga atntattttn cttggcctaa nttggnntca angc 774 

<210> 616 
<211> 769 
<212> DNA 

<213> Homo sapiens 
<400> 616 

atnncntttt tgnaatcctc tctgaaatcc tttgctactt gctctttntg caggatccca 60 

tcgattcggc cagtcctcac cttccctagt cctcgtgtgt attttaggag atgcgtgggt 120 

gtggaacagc ctcctgcctc cggtccaggt gtactggggt ctgtgtgttg tgtttctgcg 180 

tgttctcggc agaaagtggc atgctgtccc gcctgggtga tttgctcttt tacactattg 240 

ctgaaggaca ggaacgaatc cctatccaca agttcaccac tgcactaaag gccactggac 300 

tgcagacatc agatcctcgg ctccgagact gcatgagcga gatgcaccgc gtggtccaag 360 

agtccagtag tggtggcctc ttggaccgag atctcttccg aaagtgtgtg agcagcaaca 420 

ttgtgctcct gacccaagca ttccgaaaga agtttgtcat tcctgatttt gaggagttca 480 

cgggccatgt ggatcgcatc tttgaggatg tcaaanagct tactggaggc aaagtggcan 54 0 

cctacatccc cttnctggcc aagtcaaacc cagacctgtn gggtgtctnc ctgtgcactg 600 

gtggatngtc aanngcactc ttgtgggcca cacaanagat tccctttttg cctgcaanac 660 

cntgtntgaa acccccttaa cttatngccn atttncntna agcaaccctt aggcnanttg 720 

actnncnttc acaanttttt ggggcnaaag anncnaattg gcctgccct 769 



<210> 617 
<211> 766 
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<212> DNA 

<213> Homo sapiens 



<400> 617 

aganntcttc ctttctaatn nctngctacn ttctctntct gcaggnatcc catcgattcg 60 

cttcctcaaa gcatggttgc tgagnaccca nagttgcgag gngttttttt actgatttag 120 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 180 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 240 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 300 

tgtgaccgga attatgggac atgctgtgca nactgttgaa actatgaatg aaggggacca 360 

tagagtgagg gagaagctga tgcatttgtt cacgtctgga gactgcaaag catacagccc 420 

agaggatctg gaagagagaa agaacagcct aaagaaatgg cttgagaaga accacatccc 480 

catcactgaa cagggagacg ctccaaagac tctctgtgtn gctggggtnc tgactataga 540 

cccaccatat gggtccacaa naantgcagc atctctaatg aganttattc ttgcccttng 600 

ttcaangatc ttattgaaag gacatcttac agcttttccc aatgagaang cccangaagt 660 

gttaaacata ctgnnttgaa aaaagcactn tatnttntcc cntnttaana tggtntctaa 720 

aatgtanaaa naaannaaaa naaaanctcg atccctctnn aacnct 766 

<210> 618 
<211> 762 
<212> DNA 
<213> Homo sapiens 



<400> 618 

tttnnagnnt cttcctttct aatggcttgg ctactngttc tttntgcagg atcccatcga 60 

ttcgctcagt gcagcgatca tggctcagtg cagcctcaaa ctcttgggct caagcagtgc 120 

tccaacctca gcctcctgag tagctaggac tataggcaca cagcaccatg ccccggctat 180 

ttttttattt tgtagagatg gggtctcact atgttgccca ggctagtctt gaactcctgg 240 

cctcaagcaa tcctcccacc tcggcctccc aaagtgctgg gattaaaggc gtgagccacc 300 

gtacctggcc cttggtggaa tctttagggt tttctattca tacatataaa atcatatcat 360 

tggcaaacag agataatttt acttcctcct ttccaatttg gatgccttag atttcttttc 420 

cttgcctaac tgctctgtct agaactccca gcactatgct gaatagagtg gcaagagcag 480 

gcatttgcct tgttcctaac cttagagaaa aatccttcag ccttttacca ttgaggatga 54 0 

tgtttgctgt tagtttttca taaatgatct atatcaggct tgaataaatt tctatttcta 600 

aaaanaaaaa atataacnnn ntanttnatn aantnnttaa naaaanaaaa actggnacct 660 

ntaaaactta tagtngagtc gtttnaccgt anatcccana ntttgataan gatacattgg 720 

atnanttttg gacaanccnc aactaggaat ngcnntgnaa at 762 

<210> 619 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 619 

tttggagntc tttctttcta atncttggct actngntctt tntgcaggat cccatcgatt 60 

cgaattcggc acgagcggac ccatcggagc gtaacctgga tctccgcagg cctggcggag 120 

gccggccacc tggaggggca ttgcttggtt cgcgtggtag cagaggagct tgagaatgtt 180 

cgcatcttac cacatacagt tctttacatg gctgattcag aaactttcat tagtctggaa 240 

gagtgtcgtg gccataagag agcaaggaaa agaactagta tggaaacagc acttgccctt 300 

gagaagctat tccccaaaca atgccaagtc cttgggattg tgaccccagg aattgtagtg 360 

actccaatgg gatcaggtag caatcgacct catgaaatag aaattggaga atctggtttt 420 
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gctttattat tccctcaaat tgaaggaatn aaaatacaac cctttcattt tattaaggat 480 

ccaaagaatt taacattaga aagacatcaa cttcactgaa gtaggtcttt tagataaccc 540 

ctgaacttcg tgtggtccct tgtctttggn tataaatgct gtaaggtggn agccantaat 600 

tntctgcaan aagtangnca gcacttttca gtgatttgaa tatcatcttg gcttngangc 660 

cangtggaca accttgtcat aactgacttc tgaaaagaac cctntngata tttgatgcct 720 

cnggtgtngg tggaactgtc atttantngg anna 754 

<210> 620 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 620 

gcgttctttg aaagccctnt tttgaaaggc ttgcttctaa ttacgggaaa cctttgcaac 60 

tgcagatccc atcgattcga attcggcacg aggacccagg tagaccagct caagagttca 120 

tgttctttgt natcctcctg tgagctctct gtaagtcnnt ttcttgccca tcaccacatc 18 0 

cctagtactg ggtatcagtc tggccacttg gctttctggt ttgccccaat gtggtctatt 240 

cttgatgcag ctaccaaagt aatgttttaa aaccattata ccaagttact atccttgtca 300 

aaacccccag taactgccaa tctcacttag aataaaatcc ggactcctgt gaagcacagc 360 

ataaactggc cactgcctat gcagcaacct catctttacc gnttcctgcc ttgctcactc 420 

ccttccagcg ccgttattct tcctgatgcc cctagtacac aacaactcct tcctgctcca 480 

agagtaggaa aattactggt ctctctgcca gngagaancc tcttctggna ttacctttgc 540 

ttcattgcng aatcttctnc aatatcatct tctaaaaaga gccttttaaa aatcaccttt 600 

nctatnatgc cctactcatt tccagtccct gaaanggcca ttcccacttn antannactt 660 

attgctaacn tgaaatacac taaatgnnan ccttcatgaa nggtanggca anttaaatgc 720 

nttngcactg gnnaggcnaa gagaacaagc ancntggntt canaagn 767 

<210> 621 
<211> 828 
<212> DNA 
<213> Homo sapiens 

<400> 621 

tttctaatag cttgctttct aatnctnggn aacgctnggt ctctgnagga tccctcgatt 60 

cgaattcggc ncgaggggtg acagagtgaa actcgtatct ccaancaaac aaacaaaaag 120 

tncttaaaca tatgtgaaca aaaatttngt gatggaagga ttctagttaa tgagtattgc 180 

atcaagattt acatctttct tactaaggaa aagagttaat aaaaatngnt ctttatttta 240 

caggcagnta ctgaggctct tcccanntcn cagtanacag ccactcagcc ttgaaaatgg 300 

agtgttgttg tttctaaaca tatatttatg tcatttattn aggtacagtt cacttaaata 360 

accataagtn gantctctct tgtnagtgat ttgggtagga agaggccatg tctanagttc 420 

natttctctg ttgggtccna ntgaaattgg accttttnag ttgttcanaa aaatnaanat 480 

aaattnctca tattaaatca agannctcnt caanttatag atgtggggta gggttccnng 540 

taaaacccat tatnaatcta gaaaattatc nctatngana angcntttaa tatctnttac 600 

cntgaaattc attactttag tncaaggcct acctttaaan gtttnnncnn gaaccatttt 660 

tannnnttcn ncttttgnnc caananntca ttttaancca ccaaaantcn caattnttnt 720 

tncattnnaa tannggatgn naattatnnn atcnatgtgt catatttnac canganaata 780 

ctgngctncn tgnaataatn ggtacactaa anncnngann tttnntcn 828 



<210> 622 
<211> 784 
<212> DNA 
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<213> Homo sapiens 
<400> 622 

gtctttgaaa cctttttcta atncttgctt tctaatnctt ggcnactcnn ctctcnctgc 60 

agnncccatc gattcgtttg ctttcagtgg ttggctttca ctgaaagaaa gtgtaaanaa 120 

agtcagaatt tatagctttc actatgtcca agactaggac tgggttataa agattttctt 180 

ttgtgaagga aaataaaaga aaatttgcca ctactgcatt tactttacta ttgtaaactt 240 

aagattcatt ccttagtctt tggaattttg atgtctcaaa accagatgag tggaagtgct 300 

gaatttgcaa aataaagcta agaatgctta actctgcact ttaagttcta ctctgaccaa 360 

attgaagatg agcagagcag ccctgaacag catttngttt atacagtctt gtttaagaat 420 

agaatttttt taactcttca tttnttgtct ctgtggaagc tgtgtaactc tttttaaaat 480 

gcaatttaaa acattntggt attctaacaa ttctctcaan aaacagcatt tccaatggna 540 

atnggtattg ntacgctgta ccttatgtat tncctgtacc tgaacacttg atgctgcctn 600 

acangaaaat agaactttat gttaaaaaat aaaagtctgg tncttctttg naaaaacaac 660 

nncnctnctn ctcnaaatcc ncnacannnc tnnnaatntn ctaanntnag tctnnnttnn 720 

ngcanncttn tnnncccnct nanctccctn tntcntnttc atatctanan tnacanccct 780 

ccct 784 

<210> 623 

<211> 1164 

<212> DNA 

<213> Homo sapiens 

<400> 623 
gggacttntt angccntttt 
ntctgcanga tcccatcgat 
ttctntttct ntnatnnaca 
ngctngnttn aacntngnna 
nanancatnn nanntnncca 
tnnnatacnn tctacnccaa 
ancnaancan atatcanann 
canngngacn aanaacnanc 
nntntantcc cgaataacca 
ntgcatcaaa ttantaaatt 
tcnaanctnt anaccnangn 
nnatcngcnc ntanaaatta 
nataannnta naacggngna 
anntnancat aaagtcaccc 
cagaccatat aaatntcgcn 
ntggaccaca cgncntagat 
tctatnaaat ancatcattn 
nacaantaaa nntnanatta 
tanatncnta tttannnntn 
ttgtanatnt annncnanct 

<210> 624 
<211> 798 
<212> DNA 
<213> Homo sapiens 

<400> 624 



cgaaatccnt tncttccnaa tcccttngca actntcnnct 60 

tcgaattcgg cacgnagnga gcnnattcnc gttttnagng 120 

ngngaaantt ccaggnnatc ntgnnnccnt atctgantna 180 

caccnngnct nnnaancaaa tttnanaaaa gggnancncn 240 

atctaccaaa atcanaacac ncantgaaca acacananna 300 

ancnncncat nncacgcacg ataanacanc nnnnaaaaan 360 

caaccntana cnannaatca nacnctnanc tcccncacag 420 

antgataaan cncacctnnn tannacacac ctnannancc 480 

atngccacnn ctannccnat aacanantcn ctnanccctc 540 

cncnancata aagnanatca cagcctcntt cnaccnntga 600 

nanncnntat naaacnctat ancantnnna ctnnaacntt 660 

aanatcnaan actcaatatn ncggaatant nncntctcta 720 

aanacncctc anacataann gncntacnna tcgatctatc 780 

gcatattnac cnacgnncaa cataannnaa atnctactct 840 

tccntanatc agngcnanan tacaaanacg tcgcnnnngt 900 

aaacacnnat aaacantttt tanatgtaac acatttcnna 960 

atgnanacga tnacaacaaa nnctacncna tgntactaaa 1020 

aaaaagttgc aannatncng ngaaanntcc cnanaaacan 1080 

acnncggngt nnccntaaaa anaactctnn nntnnctggn 1140 

cgcg 1164 
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ttgttaagcc tnttttcnaa ntccttcctt tnaaatcttt tgnaaacctt ggtanttgca 60 

ggnatcccat cgattcgagt aaagcatcct gcctcagaat gactttccta tcatgcttta 120 

tgtgtcattc caaggtttct tcatgagtca ttccaagttt tctagtccat accacagtgc 180 

cttgcaaaaa acaccacatg aataaagcaa taaaatttga ttgttaagat acagtagtgg 240 

accctactta ttcagtcaat taagagtaag tttttttatg tggttattaa aacagtatga 300 

acaattagtc taactctgca tagacagggt ctagattttg ttaacccaaa tgtataactg 360 

cagttagctt aaattacaat ttgaagtctt gtggnttnta tatagctngg cactttatta 420 

ctcttttgaa ctgaaagcac actcccttat aggttcatgt aactgtcctg taataaggtg 480 

cttataaatg ggaacaacta cacagcctag ttttgncaca acctttagca tctaaaaaag 540 

ttttaaaagc ttcttaaatg nctaatataa anggagatgc tnatanccac aacatctatt 600 

ttaccaatat tngtttcctt acacttacct tgggantttg cattgagtga ngttttngta 660 

aaccccaaan atncccatta atanaaaaaa nttggtacgt tttnatgact ttaatccann 720 

ttncttgtng gnnttcncct aaaangcttn ccnnnggnnt ggaantnnna ntnatttntg 780 

gggnaaggtt tnngttnt 798 



<210> 625 
<211> 793 
<212> DNA 
<213> Homo sapiens 



<400> 625 

ttcttaagcc ncttttctaa tgcttgcttt naaatctttt gnaancgctc ggctntntgc 60 

aggatcccat ccgattcgaa ttcggcacga ggaaatgcct ctatgtangt gaagtgttct 120 

ctctgcatgc aacagtaaaa attaatataa tattttcccc acaaaagaaa cacttaacag 180 

aggcaagtgc aatttataaa tttatatcta aaggggaatc atgattataa gtccttcagc 240 

ccttggactc taaattgagg ggattaaaaa gaatttaaaa taattttgaa cgaatttatt 300 

ttcccctcag tttttgaggg cattaaaaag gcattaaatc aagacaaatc atgtgcttga 360 

gaaaaataaa attaatgaaa acacagcact tatgttggtt tagctgcagc ctccttggag 420 

gtagaattta tttatttaaa attactggtt gcatcaagaa ccccataggg tgtacaaaag 480 

gttctataaa atctgcatta tagagacaaa gangcaggca aatncatgtc acaagggtna 540 

agcttacagt ttacaaactg gggaacgccc agggtgtang atttnaaaaa cgncactctt 600 

gagaaaacan atgtaatcan ggntgctgaa aactttgcat ggnggctttn aagacattta 660 

gnccttgttc aaaccaaaat ttnttggnat ttgccagatt ccttantntt gccatgggcc 720 

atgacaccat ttttggcctt tatgncnctt taaaattttn aattaaaaat accntttcca 780 

gtaannctaa ttn 793 



<210> 626 
<211> 825 
<212> DNA 
<213> Homo sapiens 



<400> 626 

ntttgaatnc ctttgnaaat ccttntttct aatntntgga tccttggcna ctcgctntnt 60 

ctgnangatc ccatcgattc gaaacggcnc taggaatcat cgaaggttga gaccgtgacn 120 

anttacatag tgatnaatac ccatctatgt actgnngcct nctaaatgtn tntctncnnn 180 

atggannttn cctttaanct ctagatccat tgacancctg ancatntcta aaaggcatta 240 

ngaaactgaa cacatctgat acagaactct gcattnnctt ccnaantntg cccannccna 300 

gcctgntcct nnttcacgct tancacttat natatgatcc cactattcac tnantctctg 360 

aagcttaaaa cctangattc atgcttgact actgnataat nntacaatct actcctaatg 420 

cattagcaat tcttgctagc tctaccttca aaatatattc tgaatagact atntcttgcc 480 

gnttcccttg cctncncatt tcccatctgc accccttctc tnctncccaa aatcaataca 540 
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ctagntgttt ctaaaaaaaa tatnganann tagnnnaaaa ncntaaataa atntaaaana 600 

angnntancn tnacanaana ttnctaatat aggimanntn ntgncaanaa cnntaantnt 660 

tnaatacgnn aaaactctct cnaanngann aanntatnnn agttaaaagn naaatannnn 720 

aanantncca aatntanaag ataangncat aannntatna gncnnaacgc taantgnnga 780 

tganntntaa tnngnatana nnantngtta nnacaaaatn tacnn 825 



<210> 627 

<211> 772 

<212> DNA 

<213> Homo sapiens 



<400> 627 

tttttaatgc ttngtcgnac ttctcccagn aatcgnttng aaactcngcn actcgttctc 60 

tctgcangat cccatcgatt cggaaatttg cactgatggc tcanaaggct tacgttttgg 120 

agagtatgac ctacctcaca gnagggatgc tggaccaacc tggctttccc gactgctcca 180 

tcgaggcagc catggtgaag gtgttcanct ccgaggccgn ctgncagtgt gtgagtgagg 240 

cnctgcagat cctcgggggc tngggctaca caagggacta tccgtacgag cgcatactgc 300 

gtgacacccg catcctactc atcttcnagg gaaccaatga gattctccgg atgtacatcg 360 

ncctgacggg tctgcagcat gccggccgca tcctgactac caggatccat gagcttaaac 42 0 

aggccaaagt gagcacagtc atggataccg ttggccggag gcttcgggac tncctgggcc 480 

naactgtgga cctggggctg acaggcaacc atngagttgt gcaccccagt cttgcngaca 54 0 

gtgccaacaa atttgaggag aacacctact gctttanctc ngaccgtgag acacttgctg 600 

ntnccntttg gcaaagacca tcatgganga ncanntnggt nctnaancng nntggccaac 660 

atnctcatca acctgtattg gcatgnaccg cncttgctgn acnncngngc caaancnctc 720 

nantccgcca ttggggcttc cggnaaccac tnnacaccaa ggttctnttg gc 772 



<210> 628 
<211> 808 
<212> DNA 
<213> Homo 



sapiens 



<400> 628 

tcnctcgnaa cntttnannc ttggctactc gntctctctg caggatccca tcgattcgaa 60 

ttcggcacga gatgacatcc tcattatcca cantgcaaag ccaaccatcc ctatgatggg 120 

ttcattgtgg atcatgactt antgggtcaa gagtttggaa gtggctcagc tgggcggnct 180 

tctgctncat gtggctgcca natggtnccc tgctggtnng cagnctngtc tagagggtcc 24 0 

atgatggctt tactcacatg cctggcatct tgacagggac agctggnang caaagnnnat 300 

ctgggactgt ncacagagct ncttcntgtg gcctttccag catggtggtc taagggtagc 360 

tggacttnct gcatnacagc tcagggctcc cagagctact gtcccaagag atnnaaagtg 420 

gnaactgnca atcttttang ctaangncca gaaaccatta cccctgcacc ncacagtctt 480 

tttntanctg ntgaaataaa cattnnnttt atcaattnta ancattcgca aattggaatt 540 

aaataccttt tactaatttt gncgtgacca tctgcccctn gttcaagatc taaaaaactt 600 

ttatngntca tcntgnngat ntaaaaaact nttgtgttng catttanaac ccntaagcan 660 

nttnggcant tanannnaan annttnnnaa acccttntat anaaccttat taagttgang 720 

catnngnant ttcnccttna aatccnaggt ccttagggct angnnatacc nttcntatng 780 

naactttngg gaacctaaan cctctcct 808 



<210> 629 
<211> 827 
<212> DNA 

<213> Homo sapiens 
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<400> 629 



ggccnncttt 
gattcgctgt 
aagcatnaaa 
acannctanc 
nagactncan 
ctaangagtg 
aggcaanagc 
attannttnc 
ctniuuitcnn 
ttnanntncc 
tctacngggn 
naccaattnt 



tcgatttann 
cctcntggng 



gaaccttntt 
gatccaaggc 
taangcnaaa 
tgnntnctaa 
gcnttgcttg 
gctacnncct 
gggtnnaant 
attnntntna 
anntnnctga 
tnncgntccn 
gnnacnttgg 
nnaanntcta 
ngncnngntt 
cntnttnnaa 



caaatcnttt ggcactcgcc nctctctgnt ngntcccatc 
atgaaaagag tgcaaggtaa ncangnggca gcnttnatng 
gcnnatgctn anctnangga gcangnngct aaagacaacc 
tcatgctntg cttnctnang tgancttata gnaacgcant 
gcncaacaag gnnacctana ntcatnanga agcnnttgaa 
ttnntnctca tgcntgacct gtaatnattc ttctganttg 
natngntnan ntgnaaanac tntnnnatcc gnnnntnctg 
atgatanann ctcatcnngc tcgncctgna ctttganang 
ctttaggagn nnacctncag cganatgtna agnanngaaa 
ccttgcngac tnanngtctt gngnaacntn angtanntan 
nnaattgggg ncttataaan tnttcntnna agaatgantg 
agnttgggga aatctnngtt tcctgnatnn gnacaaaaan 
nnttgggcnt catntgccat tgatgcnatt cnacttatgt 
nnnggnngnn aacatttttt gtgtgcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
827 



<210> 630 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 630 

ttcnaatgct tggncnngag tccncctttg aacnttttca aatnncttgg caactcgcnc 60 

tctctgcatg atcccatcga ttcgaattcg gcacgaggcg ngttgttcta cactgcnntc 120 

ngaagntttn ntaanaagcc accacttagc ngaggcnnct acangtcttg gggncttagc 180 

gaagagaaat cncgctggca cttgncccgt tcacntaagn actnntgnct gantccnagg 240 

gtannngtnc accttgngnn ccancagaca nacccaannt gncntaaaan gggcaggtct 300 

aagcttacnc tngactncac nggcaagctg nangcctgtn ctgccttccn ctgcnntnac 360 

aaatngacag tnngaccaag agtcanagna aaaactncaa ggatacatnt atcccantct 420 

nttctacacc tntanattcc ntganctatt gctcanaccn atcgtgcggg caaaggcaag 480 

acttgggcaa cattnttnaa tacaatgatg ctgacaanta atttccngct ngttgccagg 540 

natntttacn cgagctnttg tgattccaaa ctaaagaatg gngccnnnan gcccntcntt 600 

antnctggnc ccccanaang ancctaactn gcgaaaggnn agnatggcat tnacccaaac 660 

caacttntng gattacnnca actccanaan atccgacggc atnnaanang caaaacaaca 720 

acttcncnan natnnaanna atngnnccnn aaananaacc cgngcntctn aaacnattgt 780 

ggacccatnc ccc 793 



<210> 631 
<211> 752 
<212> DNA 



<213> Homo sapiens 



<400> 631 



gnagtnncct tngancctct ntnaaatcgc tttngcnant cgcctctttc tgntngatcc 
catcgattcg aattcggcac gagatgttac agacatgaaa tatgaacaga atnctaaaag 
aacataaaag aataagagct ccttaaagat tataaataaa tggtgatgtt aaagtaatag 
caccattgga cgaagctagg gaatcaacac ttgacagaaa gatacatatt ttttttatac 
aaactacata tatttgagca atcaagtagt agacatagag aattttcttt ttatggaagt 
actctaataa gtaaagggct gatagaatta tatcagcatt ttctagctcc tggtgaatta 
tgcattgggc atccatggct gccttagatc acaaaaatac caccagatat atgcctgtgg 
atgaaagatc acaccaccac ctgtgaaata gtcttcccca caaaaaatcc aacccaaatc 



60 
120 
180 
240 
300 
360 
420 
480 
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ctatccagcc tgtagatggt actcgagatc ttctataaga aataaagaga gcangctggt 

cacggtggat tgtgcctgta atcccagcac tntgggaggc caangcaggt ggatcgcctg 

angtaaagaa gttcnagacc agcctgccaa catggtgaaa cccccctctn tacttaaaag 

taccnaggat gagcccggcc gttgtggcaa gcacctgtgg tccccagcta cttgggaagc 

tgagcangaa aaatcgcttg aanctgggga ng 



54 0 
600 
660 
720 
752 



<210> 632 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 632 

gnnnnnnttn nnnnnttcta atgcttggct actcgttctt tntgcaggat cccatcgatt 60 

cgcaactaga gaagattgga cagcaggtcg acagagaacc tggagatgta gctactccac 120 

cacggaagag aaagaagata gtggttgaag ccccagcaaa ggaaatggag aaggtagagg 180 

agatgccaca taaaccacag aaagatgaag atctgacaca ggattatgaa gaatggaaaa 24 0 

gaaaaatttt ggaaaatgct gccagtgctc aaaaggctac agcagagtga tttcagcttc 300 

caaactggta tacattccaa actgatagta cattgccatc tccaggaaga cttgacggct 360 

ttgggatttt gtttaaactt ttataataag gatcctaaga ctgttgcctt taaatagcaa 420 

agcagcctac ctggaggcta agtctgggca gtgggctggc ccctggtgtg agcattagac 480 

cagccacagt gcctgattgg tatagcctta tgtgctttcc tacaaaatgg aattggaggc 540 

cgggcgcant ggctcacgcc tgtaatccca gcactttggg aggccaaggt gggtggatca 600 

cctgaggtca aggagctcga gaccagcctg gccaacatgg tgaaacccca ttcttttctt 660 

aaaaatacca aaaaatttag cccangtgtt gaatggntgc atgcctgtaa ttcccagctt 720 

ctnanntagg ctnanacaag gagcttncnt t 751 

<210> 633 
<211> 806 
<212> DNA 
<213> Homo sapiens 

<400> 633 

ttnnannncn ttttnaaaag gcctnnnntt gannctttcn aatgcttggc tactngntct 60 

ttctgcanga tcccatcgat tcgaattcgg ctntagggaa ggggagggtt ggtgagtccc 120 

agaccttaaa aatacaaggt taagagggac cccaaagcaa aaaattccaa cccttttcct 180 

cccagtcatt gaaacaccaa aactattata ccggagggtg taatagtttt gctgcccagt 240 

tgtggtaggc cagtagtggc ctcccaagat gcccatgtcc taatcccagg aacctgtcaa 300 

aattaccttg tatggccaaa ggggctttgc agatgtaatg aagttaagga tctttcgcca 360 

ggaagattat cccagcttgt cangagggct tgatgtcctc acccgggtct gtataacaga 420 

agagcaggtg acgggagagg aggttggagg tgtancgatg gacangaaac tggagttata 480 

ggagggcagc tnaagccaca gaatccaggc cancttanga gcccaggaaa atgcatttct 540 

ttccacaaaa gcccttggaa ggccccaanc cctgcttccc acccttggac tnggcttcaa 600 

tgaggcttaa tttttataaa ttcntggctt gattttagaa ctcntaaggg gaaataaatt 660 

ttgtgttngn tttaantcan aaaataaatn aattaaaaaa aacttgaanc ctttanaaac 720 

tntantggaa ttcntattan cttaaanccn aancttggat taaaggatnc atttgtttna 780 

anttttggga cnaaccccca anttnt 806 

<210> 634 
<211> 775 
<212> DNA 

<213> Homo sapiens 
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<400> 634 

ngggacttcg cctnacgaac cgctnggaaa tcccntntnt gnaggatccc atcgattcga 60 

attcggcacg agtataaact ttattttatt ctcttctggt tttgtgttac atgacaagaa 120 

attgaattaa nncaatanaa ttttagttcg ggttgcttag gtttttactg ctcccattct 180 

tgcttttact aatttatcca agattagatg tgattactat ttaataataa tttagtcctc 240 

acacttacaa accacttaca ataccagcat gcttctatca ctgtaattct attcaattct 300 

caggcccatg aggcatgcca gccagacgac cagacagcat ttatagagtg ggcactcaat 360 

accagccaca aaagatcctg tgtcagaagg ggaaacaggc ttggaggctt ggagtatgtc 420 

gtgatagcct ccctccagtc cacacaactg gtactgctgg ggctgaaact agaactcang 480 

cctatgcctc tcaagctcaa gggtcggatg tccatgtnct tcgcctctag aactatannn 540 

gagtcgnaat tacgt agate caagacatgg gtaagataca tnggatgagt tnggaccaac 600 

ccaccaacct aagaatgean tggaaaaaaa tgcttaattt ggtgaaaaat ttgtgatggc 660 

tattnngctt aaatttngnn aaccatttna taaagnctng cnantaaaan aaaggtttaa 720 

ccaacccaac caattggcaa ttccatttca anggtttcaa gggtccaang ggggg 775 

<210> 635 
<211> 784 
<212> DNA 
<213> Homo sapiens 



<400> 635 

ttgagngtcc tnetttnace ctttcnaatn gettggenac tegctctntn tgnaggcatc 60 

ccatcgattc gaatteggea cgagatatag ctctggaggt caggacatag gagatattga 120 

ttcaggactt gecagagtat ggtcttgggg tgtgccctga tattacaaac agggatctta 180 

gtggctaggt gatgaggeca tggcaaatgt agatggacca agatcaattt gectttctag 240 

atgaggtttt ctaggtgaaa tgtttttgaa actattttgt agectagtat aatttataaa 300 

agtagagaga aactataaat ataaatttgg aangggttag ctaaaaggag aaaacagcan 360 

aatcttcata tatatanaaa tggatattaa tttgctagaa ttaanagact gcaggtaaag 420 

atagnttttt ttaatacctc tttttgctgt anaaaggaca ggattaaatg atnaagggat 480 

gctggaatga ggaatggtaa ctttaggcaa gatagtcttc tgngaegget gatatgaaca 540 

atngagagta anacatttnn aatacaanaa attgtcctgc tgctcaccca teaagecttt 600 

tcangtttct tcccttgcca aaantngtaa naacttntgg tacttttnna ncttgtatnn 660 

ttccngttna ttggttanaa ccccttcgat naanaanncc atantttnaa tttgggnttg .720 

acccccnagg ttaaaanttn ccntttnctc aatttcccct tttcaaagnt ttaacntaat 780 

taan 784 



<210> 636 
<211> 765 
<212> DNA 

<213> Homo sapiens 



<400> 636 

ttnnannctt tenaatnett ggcnactcgt tetttctgea ggatcccatc gattegtect 60 

gegcaggage cgcagggccg taggcageca tggcgcccag ccggaatggc atggtcttga 120 

agccccactt ccacaaggac tggcagegge gcgtggccac gtggttcaac cagccggccc 180 

ggaagatccg cagaegtaag gcccggcaag ccaaggcgcg ccgcatcgct ccgcgccccg 240 

cgtcgggtcc catccggccc ategtgeget gccccacggt tcggtaccac acgaaggtgc 300 

gcgccggccg cggcttcagc ctggaggagc tcagggtggc cggcattcac aagaaggtgg 360 

cccggaccat eggcatttet gtggatccga ggaggcngga acaagtccac ggagtccctg 420 

caggccaacg tgeagngget tgaaggagta ccgctccaaa ctcatcctct tcccaggaag 4 80 

ccctcngccc ccaagaaggg aagacaagtt ettgetgaan gaacttgaaa cttggcccac 540 



189 



WO 99/33982 



PCT/US98/27610 



ccaactgaac cgggacccgg tcatgcccgt tccnggaaan gtctattata aaggagaaag 
cttcgagtca tcanttgang gaanaagaag aatttcaaaa gccttcgctt atnttcngta 
ttngcccgtg ccaaacnccc cngctttttn ggcttaccgg ccaaaaagaa gccaanggan 
gcccnnanaa cagggatntt gaaaaagaaa naatnaaacc ctcnn 



600 
660 
720 
765 



<210> 637 
<211> 853 
<212> DNA 
<213> Homo sapiens 

<400> 637 

ttttggancc nttctttgan nctttctaat gctgggntac tcgntctctc tgcaggntcc 60 

catcgattcg aattcggcnc gaggatcagc ccacctcggc ctcncaaagt gctgggatta 12 0 

caggcgtgag ccaccttgcc cagcccacat catacagttt gaaatgaaac tttgccacaa 180 

ccagcctttg ctgtagcaca cacatatatc actgaacctg tttgaaataa agtttttttt 240 

ctttntcctc tggtattctg ggttctgaag tctggtattc tggtattctg ggttcaaaag 300 

tatgacttga gagtgttgct ctggtattct gagagttgct ctgtattctg ggttctgaag 360 

attatttgaa aaataactcc tactacattg aaatgcagac ttaaaaattt aaacattgga 420 

ttangcagtc aaaaaaacca agcaagcata aaaggtcaat aagttgtaat cttgatagta 480 

aaggtggaaa acttattata aatggnaang aaagttttat ttcctttttt gtttgaatgg 540 

gcaagtatgc catattatac ccaaaagttc ttttaaaaaa atatttccca ttcaacccat 600 

ttttaattna aaattaaaac cattttgnaa gggaaanttt acccaanggc aanccttttt 660 

tttcctccaa aaaggttnac cntgttnatc cttctttttn ggnaaattta nccaccaatt 720 

tttttaaagg ngggncaatg gggnttaaaa ntanccctgn aagnnatttt ttnanccttc 780 

caggtttaaa antccccttg gatngggtct taacctgggn gggtngnata naaaaaaata 840 

natcctnttt anc 853 

<210> 638 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<400> 638 

anttgntctt tntgcaggat cccatcgatt cgcagcaaag actttatttt tgtacagaag 60 

atggtgaagt ccaagacggt ggctcagtgc gtggagtact actacacgtg gaaaaagatc 120 

atgcggctgg ggcggaaaca ccggacacgc ctggcagaaa tcatcgacga ttgtgtgaca 180 

agtgaagaag aagaagagtt agaggaggag gaggaggagg acccggaaga agataggaaa 240 

tccacaaaag aagaagggag tgaggtgccg aagtccccgg agccaccacc cgtccccgtc 300 

ctggctccca cggaggggcc gcccctgcag gccctgggcc agccctcagg ctccttcatc 360 

tgtgaaatgc ccaactgtgg ggctgtgttc agctcccgac aggcactgaa tggccatgcc 420 

cgcatccacg ggggcaccaa ccaggtgacc aaggcccgag gtgccatccc ctctgggaag 480 

cagaagcctg gtggcaccca gagtgggtac tgttcggtaa agagctcacc ctctcacagc 540 

accaccagcg gcgagacaga ccccaccacc atcttccctg caaggagtgt ggcaaagtct 600 

tcttcaagat caaaagccga aatgcacaca tgaaaactta cangcagcan gaggaacaac 660 

agangcaaaa aggcttaaaa aggcggtttt tcagctgaaa tggcaccnnc aattganagg 720 

actacngggc cccgtggggg 740 

<210> 639 
<211> 774 
<212> DNA 
<213> Homo sapiens 
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<400> 639 

ttttnnctnt taatcaatcc tttgttgact ccttggctac ttgttctttt tgcaggatcc 60 

catcgatncn aattcggcac gangtgatgn cagattgnna ntncactaaa ctgggcannn 12 0 

catcaggatc acctgtgggc cttcannaat cananatnca cccccaggcc atgccctnga 180 

cccagtgcac caggacaaga aatccacccc aggcctctcc cnagacccac tgnaccagna 240 

caagaaatcc acccccangc cangccccnt acncactgcc ctangatntn nnggtgtnaa 300 

ccnggtggtg ctttgtaaag acgtgcangt ggtaacccca cgccgncncn ctcnnnacnt 360 

tggacacatg atcatccacg tgtctgtgat ttgnttcctc ggnttnnttt gtgaatngaa 420 

aataantgtn ncgtttgact agggtttaag agcagcaggc agnccctcag ctcagcaagc 480 

ngccctctca gctcagcang cagcccaagt ctcctgtang acttctatgg accatnctgg 540 

cgggaatgaa gaaactggtc aagctggatt cgggactgaa agtgtaccnt ggtgacaccg 600 

tatgactnan ctgactnana aagatcactn atctttccac acttgnnggg naggagccnn 660 

tannangttc aatatgcnnt ggtngantcc catngctaca atttcatgga cacantttga 720 

ttacttnnga taannnaggc ccttggaggc cccttntccc cttttaacng gaat 774 

<210> 640 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 640 

ctnnncctcc ttgatccntt cctnctttga anncatnngc tacttgttct ttttgcagga 60 

tcccatcgat tcgaattcgg cacgaggctg acctacatca gaagctgctg gatgcagnaa 12 0 

agtgaaaaca gaccaaaaca acacngggcg aatcttnaca ccattntggg tgccnnatnt 180 

nnccnnngat atttgcttgc tnagctctac tcctccaaga nannangnnt caaacnctnc 240 

agcangntag agcanntnaa gaccgcntnt nctnacctnc tnaagannct ctgngaggan 300 

cgcaatcctt tngtggaana tagaatcaac agaccacact gcnctctgga ccatgngctc 360 

tcaaangngc tagaaggtgc tgaccttttn agactcttgc agaagaggcg angtggtgng 420 

anaccctnna ggaanacttt cccgaactag accncrmctt ncngaacnng ntcaactgtt 480 

ggggnngaaa ncntgtgann tgtngncctt cngagagacg gcatattcta tgatggcnga 540 

cttnatnctt ctgcggaacc anactngacn tactgaaaga aanctganac caagcgtctt 600 

ccttaaggac ccttatatcc agacnatcct ttggataata ccnctnggcc aaaacctnnt 660 

aactntgcat acaatcngga tggcaacatt tgaactggng gccttnanna ccnttaccgg 720 

cttttcncat tatgnaagag ntn 743 

<210> 641 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<400> 641 

ctttcctttg antcttcttc tannaaacgt tngaacgaan tcngcacgag accactaaca 60 

gcatctactt gactactgat actttgatca tggagtttgg gcatgccact tgatagaaat 12 0 

ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata gattgcaaca 180 

tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag cctttgaatc 240 

ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa tacaagcaga 300 

ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt atgccatatt 360 

tctgcattta gaactcacgt ggagacatgt gtggcccaat tgctcctctt tcatctcagg 420 

caataaccag acacgggact gaggccatcc atgaccagcc agccctagtc aacacacaac 480 

acacaagctg atcacagatg catgagtaag cctaactgag accagccaag accagcctag 540 

aatagaactg ctcagcagca ataaaaacta aataaattgt taccttaagc tacttttaga 600 
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gctatttgga agtgtatttt tgtgcagcta acatttacta tcagataaaa tggtgattgn 660 

ttatctctgn tttaatgatg ntttaaggaa atggttctat taaaaggaaa tatctggggc 72 0 

tttgtcaccg ttaaaaaaat 740 

<210> 642 
<211> 737 
<212> DNA 

<213> Homo sapiens 
<400> 642 

tancctttga nnctttctcn ncntgncntn nnngnaacga cctcggcacg aggacacccc 60 

agatgcagcc accaccagca gaagcgatca nctgacccca caaggttttc gtggctgtgg 120 

ccgtgggctc aggtggcagc tatggagccg aggatgaggt ggaggaggag agtgacaagg 180 

ccgcgctcct gcaggagcag cagcagcagc agcagccggg attctggacc ttcagctact 24 0 

atcagagctt ctttgacgtg gacacctcac aggtcctgga ccggatcaaa ggctcactgc 300 

tgccccggcc tggccacaac tttgtgcggc accatctgcg gaatcggccg gatctgtatg 360 

gccccttctg gatctgtgcc acgttggcct ttgtcctggc cgtcactggc aacctgacgc 420 

tggtgctggc ccagaggagg gacccctcca tccactacag cccccagttc cacaaggtga 48 0 

ccgtggcagg catcagcatc tactgctatg cgtggctggt gcccctggcc ctgtggggct 540 

tctgcggtgg cgcaagggtg ttcaggagcg catggggccc tacaccttcc tggagactgt 600 

gtgcatctac ngntacttcc tctttgcttc atccccatgg tggtcctgtg gctcatccct 660 

gtgccttggc ttgaatggct ttttggggcc tggncctggg ctgttaaacc gccgggctgg 720 

natttaacct ntnggcn 737 

<210> 643 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 643 

cttttaaccn tttganccnt ccctcnaaac cttngatncg anttcggcac gaggaaggca 60 

gaagtgtaaa tgaacataca ntttaaggag aaagcctgct gtgtt timet tgttcagcag 120 

ggtattatga attagcacaa gtattgcttg etatgeatge taatgttgaa gatcgaggga 180 

ataaaggaga cataactccc ctgatggcag cttccagtgg aggttactta gatattgtga 240 

aattattact tcttcatgat gctgatgtca actcccagtc tgcaacagga aacactgcgc 300 

taacttatgc atgtgctgga ggatttgtat gacattgtta aagtgctcct taatgaaggt 360 

gcaaatatag aagatcataa tgaaaatgga catactccct taatggaagc agecagngea 420 

ggtcatgtgg aagttgcaag agttctttta gatcatggng caagcatcan cactcattct 480 

aatgaattca aagaaangtg ctctaacact ngcttgetae aaangecatt tggatatggg 540 

gegctttcta cntgaagctg gtgeagatea agagencaaa acagatgana tgcacactgc 600 

cttaatggan gectgeatgg atnggacatg tanaggtggc aegtttgett tttggatant 660 

nggtgctcan gtgaacatgc ctgeataate atnttgaatc tccattgacg ctagctgcct 720 

gtgganggac atgttgaaat tgengect 748 

<210> 644 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 644 

tennncnett ttcgatcttt tgagncttgc ctttgaaccc cttggntacg anttcggcac 60 



192 



WO 99/33982 



PCT/US98/27610 



gagggaacca tgananccna gagctagaat tgctattgga tnncgtctat tctctntttg 120 

cttattgggn cgngntncgt ggttnctggc ctcangggtn nncccgaang anggggtatc 180 

tnngagcnan ttntgcnntt tacnggctag cttgntgggg gcttaanntg ccactnttan 240 

acatgctnta ctantcantg agannntncn ntcgaccatn tannacnatn ctgtgnnntc 300 

cngtacnctn tggccgnatg gagctattag cttcaanatg nntcgnantg ttacatgcan 360 

ncactgannt nactatccan natntaagtn ctcttngctt actgtgaaca nnngctactn 420 

ncttggatat tatagnaagg ntcnttgata cncgatnatc ntncntgtca gatcnataaa 480 

tancanctat accnactgtn naaatnccat ctggnggnct tncnatccan acataattgc 540 

attannncgt cnaattgnga tanagtnttg aaagantctn ggtttagacn ttggatgttg 600 

caatgnttgt gncttanaan ttatgtgctg gctactgant aanctggggg catgacntta 660 

ctggnttgac ctaagnggng aantcnatgg tccgattgct ggnccctanc cttaagnttt 72 0 

gccatgaata ggncttttgc cctaaaataa naccccttt 759 



<210> 645 

<211> 766 

<212> DNA 

<213> Homo sapiens 



<400> 645 

tnnnnnnntt tcaatntttn ancgtccctt aggatccntc gattcgatcc agatgggata 60 

cctctaaaca cgaaaagaaa gaagattcca ttantgaatt tttaagtttg gtttnatcaa 12 0 

aagccgagcc acctangcaa cagtccaccc ccttagtaaa caaagaggaa nagcatgcac 180 

cagaatcatc cgcaaatnag acagtcaaca aagatgtgga cgcacaggct gaangagaag 240 

gganccgcca tccatggact tattcatggc catctttgcc agttcctcat atgaaaagtc 300 

ctnatcctgc gangatganc acggtgacag tnaanatgat caggcacgct ctggngagga 360 

caacttccaa agctggnaag acactgactt ggnggaaaca tcatctgtgg ctcacgctnt 420 

tgtgccagng ccctaggagc cgtcaccttc cttcccgata caaangatgc agatagatna 480 

naganaagag ntcggccngn ngctgcctcc cgtcttatgt nccaatgctc gtcagacact 540 

tgaagttnct canaaagaga aacattccaa gaacaaagac nagcacaang gcaatanaga 600 

acacaggccn gaaagaattg anangaaatt ggaaacactn gaagcacnaa acacctaang 660 

naatccaaaa naattggcaa accaggggaa aagtaggtnc ctncgngaag tttcgacagc 720 

cngcggacaa gccanaattg acnatgaaac cgcatacgtg tcttnc 766 

<210> 646 
<211> 752 
<212> DNA 
<213> Homo sapiens 



<400> 646 

ttnnnnnntt tttatcctnt natncttnct ctttggatcc atcgattcgc tccaaggaaa 60 

atccacctcg cagcttgtaa atctacagcc tgattacatc aaccccagag ccgtgcagct 12 0 

gggctccctt ctcgtccgcg gcctcaccac tctggtttta gtcaacagcg catgtggctt 180 

cccctggaag acgagtgatt tcatgccctg gaatgtattt gacgggaagc tttttcatca 240 

gaagtacttg caatctgaaa agggttatgc tgtggaggtt cttttagaac aaaatagatc 300 

tcggctcacc aaattccaca acctgaaggc agtcgtctgc aaggcctgca tgaaggagaa 360 

cagacgcatc actggccgag cccactgggg ctcacaccac gcagggaggt ggggaagaca 420 

gggctccagc taccacagga cgggctctgg gtatagccgt tccagtcagg gacagccgtg 480 

gagagaccag ggaccaggaa gcagacagta tgagcatgac cagtggagaa ggtactagtc 540 

aaccttcaga aagagtatgg agagaaaaag aggcacacct ggacgcagag ccctgccagc 600 

gccctctctg ctgttgcagc tgcaaggaga ccatgcctgt gggagccagg cctcgcttgc 660 

atgaanaagg aacgatgcct ttttcaatgg tgtcttcctt ccattgtgca naanaacctt 720 
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ttggtggctt ctcttccgac ttgtgcctga tt 

<210> 647 
<211> 743 
<212> DNA 
<213> Homo sapiens 



752 



<400> 647 

ttaatccttt caattcgttc ntctttggat ccatcgattc gaattcggca cgagccctcc 60 

ccggcttccc ccggagtggg tcaccacact gttttttatc atcatgggaa tcatttcatt 120 

gactgtcaca tgtggtttgc tggtggcttc ccactggcga agagaagcta caaaatatgc 180 

tcgatggata gcattcactg gaaccactat gagaagatta taggaaaaac accaagacta 24 0 

gaggactctg ggttcctttt atgcaaagtc aactcttctg ggtcacagtt acccagcaac 300 

aaaaataaag agaggaccag gacgatgcca gcaccccgtt tatcctgagt gaactctccg 360 

gaggcctctt caagcttgtg ggttctctgc tgtcttgaag ccatccatcc atttgatagg 420 

ttttgcaaag acttggtcct gccaagatgg ttttaatcat ttctgctaaa aggaatggac 480 

tcgaggattt gatctcattt tagatgcagt tgtcctcact tggccatttt acagcacttt 540 

agtaaatatg gccagtgtat ttggtcacta ttaaatcaat ccccattcat tatctgtcan 600 

ggcaactcag tgaactaaat actatgttct gacctctggc actctttctc atgttggtta 660 

aatatttaat attgnctaag gcaattcaag tatttttctt aaataaaaaa tatgaaaact 720 

caaaaaaaaa aaaaaaaaan ana 743 

<210> 648 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 648 

ttttaatccc tttcatttcn ttccttngta ggatcccatc gattcgtttt tttttttttt 60 

ggtgattgga ttaacaattt tattctgnnt ccactacaaa ngggctggtg ttttgttcca 120 

aatgtttagc tgggagggct gtagggaccc ctgttacccc cattaaacac agtaaagcat 180 

ggatccagtc agccccctgc tggcaggtgt gggcctggca actacacaga tccaacccca 240 

ccctcctggg tgcggccaga ggccaaggca gtcgcccgag ctcctgaatc ccaagaatgg 300 

ttctggcaag tactgctgtt tgtttgtagg ggcaaagagt taaaataaaa cgaggttctg 360 

ccatggctaa gccttgtgga aaccagaccc caaagcccct gccatgccan gggtctcaac 420 

nccagacgct tgttatggag gcaccancng gtantggccc ctgtaagcan ggccagagtc 480 

gggacaaaga gcaagantga aacanccaag agacanagga ccatgctgga ccattgggca 540 

cncangaacc tgcctgggaa aaaccggggg gcaangctgg catgggaatg aacacctgct 600 

tgntgacacc tatntgagct tcanttncct taacttgaaa aattgaacan gcccggtncg 660 

gtggctcata ccctgtaatc ccancacttt tgggangctt tangccgntt ggatcattga 72 0 

ngttaggaag attaaagaac cancctgggc cnacattgg 759 

<210> 649 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 649 

tnancctttg aatccttgaa ngnngatccc tcgattcgcc ggaacctcat ccagtgccac 60 

ccatcttgac accttctccc tcttcagctt ttccaacagt cactactgtg tggcaggaca 120 

atgatagata ccatccaaag ccagtgttgc atatggtttc atcagaacaa cattcagcag 180 
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acctcaacag aaactatagt aaatcaacag aacttccagg gaaaaatgaa tcaacaattg 24 0 

aacagataga taaaaaattg gaacgaaatt taagttttga gattaagaag gtccctctcc 300 

aagagggacc aaaaagtttt gatgggaaca cacttttgaa taggggacat gcaattaaaa 360 

ttaaatctgc ttcaccttgt atagctgata aaatctctaa gccacaggaa ttaagttcag 420 

atctaaatgt cggtgatact tcccagaatt cttgtgtgga ctgcagtgta acacaatcaa 480 

acaaagtttc agttactcca ccagaagaat cccagaattc agacacacct tcaaggccag 540 

accgcttgcc tcttgatgag aaaggacatg taacgtggca tttcatggac ctgaaaatcc 600 

atacccatac ctgatttatc tgaangcaat tcctcagatt tcaactatca aaactaggga 660 

aaactgngag tttaacacca agtnctacaa cacaaggttg gaaacacctg aacttggngg 720 

atcatgatac cacttnacca ctccnt 746 



<210> 650 
<211> 789 
<212> DNA 
<213> Homo sapiens 



<400> 650 

tgaccctttt gaaantcctt gcatntttca naccntttgg tacnnncant ttnngntgga 60 

tccctcgttc gctgnacaaa agatgttttt caattaaaag acttggagaa nnttgctccc 120 

aaagagaaan gcattactgn tgtgtcagtn aaaggaancc ttcaaagctt tattngatga 180 

tgggtttggt tggactgtga gaggatcgga acttctaatt attattgggc ttttccaagt 240 

naagctcttc atgcaaggga aacataagtt ggaggttctg gaatctcagt tgtctgaagg 3 00 

gaagtcaaaa gcatgcaagc ctacagaaaa gcattgagaa agctaaaatt ggcccgatgt 360 

gaaacggaag agcgaaccag gctagcaaaa gagctttctt cacttcgaga ccaaagggaa 420 

cagctaaagg cagaaagtag anaaatacaa agactgtgat ccgcaagttg tggaagaaat 480 

ccccaagcaa attaagtagc caaaagaagc tgctaacagg atggactgat taccatattc 540 

gcaataaaat cttgggccaa aagaaaattt gggttttgaa agaaaataaa aattgatngg 600 

aactttttgg aattccagaa gactttgact acatagactt aaaatattcc atggttggtg 660 

aaaggatgta ccaagctttg tgaaatattg taaattttta aacctattat ctactaaagt 720 

ngtactggaa ttgtccnttt gcctgttnac ttgnggtnta ntcatttnta tttaatgntn 780 

aaattaang 789 

<210> 651 
<211> 757 
<212> DNA 

<213> Homo sapiens 



<400> 651 

tnnnnnctaa ncctttgaaa tcgtccntgc atgatccctc gattcgaatt cggcacgagc 60 

agatatttac tgaaggaatc taggttgttt tttcagtgga caatgggaat aanncatttc 120 

taaagcaccg actggagagg aaggcaacag agacaaggag agaagccgag agacatgtct 180 

gcgtgctgcc acgcatctga gcgattgctc tgtgaagagt tgtacactga acattttcag 240 

gggaggctgt ttacccaggc aatgtcctca aacaagcctg tgccggggtg tcctggaatc 300 

tgtgccagga ctgtgttttt agcccttcac ctctcagctt tagcaggaca tgaaccagtt 360 

ataacaagat ggccctgcag ctggttacag gaatgtgaca tggcaggatc tatggaacca 420 

aatggaaggt tttnaggtga tgtaggtctt tcacagttag ctttggggaa tacagaatac 480 

tcaaataaag tgctttgtta ttatttcaga gggaatggcg attgaaatgt tacaacagag 540 

atttcttggt ggtagctatt tgggtaaang tatatggata tttntctgta catgtgaaat 600 

tatntaaaat aaaagttata taaattacat tgacaaaaaa aanangtana aaaaaaactc 660 

gaacctttaa aaactatngt ggagtccgta ttacgttaga tccagacctt gataaganac 720 

cattgatgaa ttttggacaa accccactng aatgcnn 757 



195 



WO 99/33982 



PCT/US98/27610 



<210> 652 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 652 

tcnnnccttt aatgctttga actcgttgca ctgcangatc catcgattcg aattcggcac 60 

gaggctgncc aggcagttnn atggcctnct ggttgtgtgc cttcacaccc gcctacagcc 120 

ccacctcacc atcaagcgct gagccaatgc ggntgtggct ggccctgagt tcctgagtca 180 

gctccttgcc agggccagag ctggtnacag cggggcanca nggtgggtag cctctaccag 240 

ncagggcagt ccctgagggg ccagcanggg ggctgactgc ctagtggctn aacctactga 300 

acccacccac tcccagcgat gctacccaga accccaacgg cntgaatcct gcacantgcc 360 

gggcantgcc agactcnaaa gggctcgctg tggggacagc cccgtcatgg ccacanactc 42 0 

tgtcctcacc tttgattgtc aggatgacag nccccaccac catgatgagc gtctgcaggg 48 0 

cgtccgtgta gattacagca gccaggcccc ctgccaatgg aagcaaggtt gctggaaggg 54 0 

gccctggtcc agggaggaag gacaccggga ggaacttctg ggcttctgct ggggccactt 600 

cctgggctgn tnctcnggnc tgtatgggga agtggccttn tgaccccttt acacgttccc 660 

tgggtggacc ttccctgntt gcangcaccc ataccttgcg atggtgtnca nggctnttga 720 

tgcccnaaac tttaggattg ttggatangt nnaatctnc 759 



<210> 653 
<211> 820 
<212> DNA 
<213> Homo sapiens 



<400> 653 

tgcaatcccn cngnnaatcg ctttgaaanc nccntcnctg tatgatccca tcgattcgca 60 

acagtccagg ctctgcagac agcatcccac ctgtcccagn tngctgacct gaggagcatc 120 

gtggnggaga ttgaggacct tgtngctcgc ctggatgaac tcgngggcnt gtatctccag 180 

ncanaanaan gacngcatac aacagaccat tangangntg tcatctacan tntnanngat 240 

catntgngna cngacccatc cattaatgag gatcanggcn tccanctgat gaacgctgat 300 

cttctgcaan aagaacgttc tagntctanc nnanngccnt cancctncgn ctcttgagct 360 

cagtgngtca ngctcntaan atcttnncac ntgccaanct gtgnggnctg ccttnagnct 420 

tccggatagg cactntnatn ngacntgccc tatanttgcc ngcngnnant naaccaantg 480 

naccatngtc actctgttga catcanggcn atritgnntaa actaatnnct tngcngcact 540 

ctagtnngcg ttgncactgc ccncgtnnnc tancntacca nttcncattn cccntttaat 600 

gggnaaagan atnatcccta cnatcatatt ncccntnnaa tggattcgag ncgnaantct 660 

tnnntantna tctnaancct aaatgntcac atnnaaactt tanangncat cnnnnatgna 720 

accnancnat ggctaaangg cctcattaan gccngntttt tcaaacttga aaantgcatn 780 

ccnccattga naaagganta cacgggcccc cntgngnggg 820 



<210> 654 
<211> 768 
<212> DNA 
<213> Homo sapiens 



<400> 654 

tttnnccccn ttttgtncct nttgattcnc ttgctacntn ttcaaatcng tnggatccca 60 

tcgattcgcc acatttaagt gagatatggg aaggaggagc agattgtttt tgaagggagg 120 

aagagcagtt acttagggtc aaattaagtt gtaaaatccc ccccgggatt ttgtatgtaa 180 

gtcaaagtga attgtatttg gaagaagaac tggggagccc acctctggta ttttttttat 240 
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gtccctcata tggacaaata aacctctggt attaaatgaa 
tatattcggg atttcaacca ccaacctatc tggtttttcc 
gaatcaggag agcagatttg gagactcttt atattttata 
aaccgtttga tttgaaaaag ttttctaggt tccctcaggt 
aacagtttat tcaaggtaca tagcctacta gtttcccatt 
acgacgtgta ctgcttctct acgcagaatg aagtataaaa 
aatttgcagg tgctaaactt tttacatgct tnatctcatt 
ttttaccaag taaantgtct ggaccaacca tntgcaggtc 
naggtttgga ctcctacata gcctnttttn taagtnncnt 



ttttcttttg ggggattcta 
cgctgaaatg ttgggtgatg 
attgagagag acaaagagaa 
agatggaaat tttcatcaaa 
tgagagtacc gcagaatgat 
ttagcaccna atagtacttt 
taattcttag aagaaactaa 
caaaannctg gaaaaaccgt 
nntaaatn 



300 
360 
420 
480 
540 
600 
660 
720 
768 



<210> 655 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 655 

tntncctntt gatccttgca ctannaaatc cgtggatccc tcgattcgaa ttcggcacga 60 

gggtaaacct atttatataa tagaaggatg attataaaca tttaataaat tatatcaaat 120 

agatattata tattaaatgg gcagataata gaaatctgtc caagcaaaac tctggataat 180 

ttttatgttg ccttattttt tgttttctgt gaactccaag aaaaatgaga taccagtttg 240 

gaacagatgt aatattgctg atttaacagt ttagggatac tccccaagtt caataatttt 300 

gccaagatac aaatttaaat ggaacctttt atgaagcttc atagtgtgtg aagaacttac 360 

cttgtttata tgtttgaaga catacatatt tcacatttca gaagagtcta tacatagctc 42 0 

accaaatatc aaaaccacct tgttagaaaa cattaaggtc tgtcttattt atttgttcat 480 

ttgnttatga gacacantct cactctgtaa tctcactctg ttgtagaggt tgagtgcagt 54 0 

ggcacgatca cggctcactg caacctncat ctccctgact caaggaatcc ttccacctca 600 

gccttccaag tagcanggac caccaggtgc accccactat gcccagctta attttttgna 660 

ttttattgga cagattgggg ttttgcccat gttattcagg ctggatcctt nnggcctcaa 720 

actcctgggg cttcaagcca atctggcctg cc 752 

<210> 656 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 656 

ttttcctttt natcttgctc nanaanccnt ggatccctcg attcgcagag gctggttcag 60 

aaaaggagga agaggcccgg ctggcagccc tggaagagca gangatggag gggaagaagc 120 

ccagggtgat ggcaggcacc ttgaagctgg aggataagca gcggctggcc cangaggagg 180 

agagtgaggc caagcgcctg gccattatga tgatgaagaa gcgggagaag tacctgtacc 240 

agaagatcat gtttggcaan aggcgaaaaa tccgagaggc caacaagctg gcngagaagc 300 

ggaaagccca cgatgaggcg gtgaggtctg agaagaaggc caagaaggca aggccggagt 360 

gagtgcctgc ggcccctcac agggctgang ccagccccta tcagctggat gtggcagagg 420 

catgccanag gacctaagtg tgatggacca gantcacttc tnctcctcct ttctncacca 48 0 

gccctgaccc ctcatgctct ctggctgggc cantgggcaa ccctcgcttc cttggatgga 54 0 

ctgcctgctg gtgcctggtc agagaanagc ctnttttccc agnctgattc tntgctccca 600 

ggaaccaatt gaccatnaag gtgcaaangc cnanccaatc cccttacnta ctggccccca 660 

ttnattcctg gctttttcan aagcccccnt gccaaacann ttgggacccc ctgattnntt 720 

aagggtgcct tttnatnggg gttaaaggtt aant 754 



<210> 657 
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<211> 734 
<212> DNA 
<213> Homo sapiens 



<400> 657 

tntgttccnc natgaacgnt ngaancnnna tnccnttgga tcccatcgat tcgctgcggc ,. 60 

cgcaggagct gtggcggttt tcctaatcct gcgnttatgg gtagtgcttc nttccatgga 120 

cgttacgccc cgggagtctc tcagtatctt ggtagtggct gggtccggtg ggcataccac 180 

tgagatcctg aggctgcttg ggagcttgtc caatgcctac tcacctagac attatgtcat 240 

tgctgacact gatgaaatga ntgccantna aatnaantcn tnngaactan ancgagctga 300 

ttganaccct agtaacatgt ataccaaata ctacattcac cgaattccaa gaagccggga 360 

ggttcagcag tcctggncct ncaccgnttt caccaccttg cactccatgt ggctctcctt 42 0 

tnccctaatt cacagggnga agccngattt ggtgatgngt tacngaccac gaacatgtgt 48 0 

tcctatctgn gtatctgncc ttatccantg ggatactagg aataaagaaa gtgatcattg 540 

ntactttcaa agcatctgcc gggttgaaac gatntncatg tcccnaaaga tttgttgatn 600 

tgcagctnct cantgctann gtcggttttg aanaaagttt nccaaatnnn tgtaccttgg 66 0 

gccaatttnt ngacaantng aactgacttg tnagaatctt gcagntaacn gtcttgtntc 720 

ntccaattng ggng 734 

<210> 658 
<211> 783 
<212> DNA 
<213> Homo sapiens 



<400> 658 

ttctcctgaa acgcttngca cttccctcnc tgcaggatcc catcgattcg aattcggcac 60 

gagacactgt cccactccat cacccaggct ggagtccagt ggtgtgatca tagctcgctg 120 

catcctccag ttcctgggtt caagccatcc ctcctgcctc agcctcccca gtagctggaa 180 

ctacaggtgt gtgccatcac acctggcttt acatttttct gtggggtctt actatgttgc 240 

ccaggccggt ctcaaactcc tgagctcaag tgatcctctg nctcagcctc cagagtatct 300 

gggattacat atgtcggcta ccgtgtctgg ccgttcacat ctttggccac tattngcttg 360 

tgaaaaggta tnatgaggtg gtacttatca tngttactgt gtctcatgtt nngtatattt 420 

ttgcttcatc aactaagatg cactgtaaca tctgtgaaat ctggatatat tatcaaangg 480 

tttatcatag ttttgttaac aatacactgt cgttttactn ggtgcctaan ataatggtat 540 

agttgngagg tgatcttaga tttgatgaag cacagtatgc aangtaggcc taatggnggg 600 

aaagaatggg naattttcan angcnnggaa gtatttgntn ttttgtaaat ggacttgaaa 660 

agcttgttct gnnggattgg acccaacccc tttccctttn aaaccccgaa ttctnatnga 720 

ctnttccaac ttngaaaact ttgctcnaac ttaaatacct ttnaaaaatt aaccntgacc 780 

ccg 783 



<210> 659 
<211> 741 
<212> DNA 
<213> Homo sapiens 



<400> 659 

tcttcctttg tatacctgct nttgctcttt ntgcaggatc cctcgattcg ctttgagcta 60 

ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt ggaaatgtga 120 

tcttcccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac agaaggtata 240 

attttttgac accctttccc acctggtgcc tatgctaggc ttgtcctgag aacatccctc 300 
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agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctcttcc tgctactcgg ttaatgagct tggcaggttc 420 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagtgtct gtgctaaccc agtggtaaat cccttagatc ccctgctggt ctctggcagt 54 0 

ctccttgatt ttgggtacca tgtatatttt ccgctttgac tttaacgctt tctaggatag 600 

ggtaagcacc cttaattcan gcactgtcca ttagcttcct ttgcaaaagc tacttatggn 660 

cggtcacaat ncaacactna nacagagcca aggcaatatc ctcttgccca tggctatgat 720 

gtcagacagt ggatggctcn t 741 

<210> 660 
<211> 734 
<212> DNA 
<213> Homo sapiens 



<400> 660 

tctgnnctnt gtntccttgc tcgtgttctt ttgcaggatc cctcgattcg aattcggcac 60 

gaggactgga gaagtcagaa gtagaaaagc agattgctag gagagacagg atgacagatt 120 

ttggtcagaa aatgggatat tggagtttaa agtatcaaat acagaatagt tccagatgtt 180 

cagagatcca gcatgggatt aggtactgaa atggattaga actaaaagtc actagaattt 240 

agaaattgag aaccatgaga gtggatgcaa tgacttgttg cttgattgaa aaataaatta 300 

ataataataa aggaccatga gactagcctg ttataggggt tatctccatg aacattgaat 360 

tttcccagga tcatagcagg aattgggtag agaaaaagat tatgagaagg tgccagagtc 42 0 

ttcagtgaat gtcaggaaat taccaggaag tcagcatatg acagagaaaa ggacagtatg 480 

ttatctgcat caaaggaaaa tgtgcttttg ttgaaaagta cagaaaaagc caatactaca 540 

atactgtgct aagcccctac ctgtactcct ctcccacagc tgcattccag ccctgtggta 600 

taaaaggtgt gagaatgagc ttttccacca gaatcagcag gtttagttaa agcatgagca 660 

gaacaagcat nctatgaaga gactgaggat gtaggtgagt ggtctaaatc tcatnnaagg 720 

acattgcagt ngat 734 



<210> 661 
<211> 762 
<212> DNA 
<213> Homo sapiens 



<400> 661 

ttnnnnnnct ccnaatcctc cngatnanat cnctttgnan ctncctgcag gatcccatcg 6 0 

attcgaattc ggcacgaggt ccatacatgg agctccctgg agcccgtgtg ntntcgtgtg 120 

actgaacgtt ttgtgatgaa aggaggagag gctgtctgcc tttatgagga gccagtgtct 180 

gaattgctga ggagatgtgg gaattgcaca cgggaaagct gtgtggtttc cttttacctt 240 

tcagctgacc atgaactcct gagcccgacc aactaccact tcctgtcctc accgaaggan 300 

gccntngggc tctgcaaggc gcanatcact gccatcatct ntcagcaagg ngacntatat 360 

gtnnntgacc tgnagacctc agctgacnct nccttngtan ggttngatnt nggaagcatc 420 

ccaaggngat ttagngacnn tggantcctn atnactgata anacncnaac tatantnttt 480 

tacccttggn agcccaccag caagaatgag ttggagcaat cttttcatgt gacctnctta 540 

acanatatac tctgaatgaa tctacgttgt atttatcagg nggacaatgg gaataaagcn 600 

tttntaaagc accnantgga catgaaagca acagacacna ggagnnaagc cttgagacat 660 

gtctgnnntc tgaccgcatn ttgatccant gntctgtgan ganttnttca ctgaacattt 720 

tcaagaggag ggtgnatacc cctggcaatn gcccnaanaa ag 762 



<210> 662 
<211> 745 
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<212> DNA 

<213> Homo sapiens 



<400> 662 

nanatccnnc nantncttnt tgttcntgtc cgnangatcc catcgattcg aattcggcac 60 

gaggtttcat ttaagaagaa tganctagat anatgtgctc ttctggttac cccaccctga 120 

cagagtgcat ttttacacgg ctagcagggg ttgagactgc agcctggcct gccagccatt 180 

ggaggtgttt aaggaagggc agataatgtg actctttgcg gggtgccatc tgcttaccca 240 

ttagcgagca nagggggttt ctgcgggtga cccccagcat atttctaggt tacttatggg 300 

cagatttgta agtgacaaaa ctccagctga tgctgggaat ggggagaggg cccttgaggg 360 

actttgtggt tttgtgcttc tggtttcctg gccaacccca gggtcacttg tctggaggcc 420 

cagctgggca ctaatgtctg ccaccgacta tgttaaagtg tataaatgat tcctctattt 480 

gggagagatc ttccaatcca gaggagcccn tcttggactg cctgggttaa atctgcatan 54 0 

cagangtggt tgatgaagtt catctgaaga aattcagccc cacctnccca ccctgccntt 600 

cctgctccct tttgatagtg gcttctgggt actcgggcnn gtncttggga caccancctt 660 

ntctgggggt ctnaagccat cccgttgggg ctgtcggcca agcctaagtt aatcgtgtgc 720 

ctntattggg aggatngctn ntcct 745 



<210> 663 

<211> 748 

<212> DNA 

<213> Homo sapiens 



<400> 663 

taatcctntt gataanaatc cttgtncttg ctnntgancc ntcgattcga attcggcacg 60 

agggcaagtt tccaaagatc agtgtggagt gctacagaaa taattatagg agaggaaatc 120 

ataatcacag aaggtataat gcttgtttga ggctccggaa taagaactaa aaaaaaacaa 180 

aaaacactgg tttcatgctt acggggtaca cactttggtg catcccgtga acacaaattt 240 

taataccaaa caatccttga tgcttcacct ggggctgcca agcagtttgt aaaacagagg 300 

aaaacattta gtgcagtctg tattatcctt ttccaacttt tctgtttgtg caagtttttg 360 

aagattcatt ggccaaacaa tgaacaacaa aggttttctg agagaagaca aggtggactt 420 

ttcattttgt tagtaaatac cagtggcact gttgaacgaa acaaatactt ttatctcagt 480 

ctttcaaatc agtattaatg tctgtgtttc cttccactga cagctcttct tctagtttca 540 

ctgaaaaaag ggtgttagta tttttatctt ggacactctc ttccaaatcc ttcagcagct 600 

cctcttcttt atattctgcc acatcgacct ctaaaccgga attgtccttc agtttgccgt 660 

ggtgcttgag atantacccg ctggttctga aagaacttga tgatggtgta ctttgggaag 720 

gtcnaactgg gcanacagag tctggatt 748 

<210> ^64 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<400> 664 

gtnnnnccnc nnaccctnnt gaatntaatc cttgttcttg ctgcatgatc ccatcgattc 60 

ggtcaagctg gccctggatg tggagatcgc cacctnccgc aagctgctgt agggcnagga 120 

gtgcaggctg aatggcgaag gcatatggac aagtcaacat cnntgnagng cagtccaccg 180 

ncttcagtgg ctatggcgnt gccagcgntg taggcagcng cttaggcctg ggngnnggaa 24 0 

gcagntactc ctatggcant ngncttgncn ttggatgcng cnntagtncc agcagcggna 300 

nagccactgg gggtggcctn agctctgtng gaggcggcag ttccaccatc aagtacacca 360 

ccacctcctt ctccagcatg aagagctaca ngcactgaan tgctgccgcc agctctnagt 42 0 
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cccacagctt tcaggcccct ctctggcagc atagccctct cctnangttg cttgtcctnc 

cctgncctcc antctcccct gccctaccgn gnagagctgg gatgccctca ctttntnctc 

atnaatacct gtttcactga actcctgttg cttaccatca tgtcncagtt atcagcactn 

aaancatgct aatgnccttt tataagnccc ngtatttatt acaagnatct tgaantctgc 

cattaaattc ttgaggaang aaaatgacct attatccccc ataaagaacc tgaaacttca 

agnctaangt cccagcntnc aacanggaag gagntccntt tttttnattn gctaaaccan 
tcctc 



480 
540 
600 
660 
720 
780 
785 



<210> 665 
<211> 763 
<212> DNA 
<213> Homo sapiens 

<400> 665 

ggnngntgnn nntnntaatt nctnttnaat nncantcctt ggntctngnt ntagganccc 60 

atcgattcgc tgaaccctaa aggaaagcca gcaaaccagc tgcttgctct caggactttt 120 

tgcaattgtt ttgttggcca ggcaggacaa aaactcatga tgtcccagag ggaatcactg 18 0 

atgtcccatg caatagaact gaaatcaggg agcaataaga acattcacat tgctctggct 24 0 

acattggccc tgaactattc tgtttgtttt cataaagacc ataacattga agggaaagcc 300 

caatgtttgt cactaattag cacaatcttg gaagtagtac aagacctaga agccactttt 360 

agacttcttg tggctcttgg aacacttatc agtgatgatt caaatgctgt acaattagcc 420 

aagtctttan gtgttgattc tcaaataaaa aagtattcct cagtatcaga accagctaaa 480 

gtaagtgaat gctgtagatt tatcctaaat ttgctgtagc agtggggaag agggacggat 540 

ntttttaatt gattagtgtt tttttcctca catttgacat gactgataac agataattaa 600 

aaaaagagaa tacngtggat taaagtaaaa attttacatc ttgtaaagtg gtggggaggg 660 

gaaacagaaa taaaattttt gcactgctna aannnaaann actttccagc naanctaaaa 720 

aactnnancc tttaaactat antgagttcg nanaccnggn ccn 763 

<210> 666 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 666 

nnttnnatan nngctcttgt tctttttgca ggatccctcg attcgtctag acctctgaca 60 

tcatggtgtt ttcttaatgc ctcacattgc tggcacgggg atgtgccctg cctgccagca 120 

cctaggactt cgagttgggt tgcagcttat gacatgcatg ataggttttg gaaggtaact 180 

tttaactgca aacctataaa gtactatttt ttattttata aatgaacagg gttttaacgt 240 

gctcaacttt aatttttttc aattgtatga aggccttaaa aaagctacat taagcgtagc 300 

taaaattatt tattggacta aaaactaaca gaacttcatt tccagaattt ttttttttgg 360 

caaatgttta cattcaatta aggggaaaaa gtagaaccag cacaaatgag tggcagttgc 420 

tggagcataa ctgcttcaat aaatcttcat cttggggtaa ttacaggcaa gtcattttca 480 

catcctcttg aggttcagag catcagaatg aactctatga atacatgtgt aagtgccaga 540 

cagctgaatc tttatcaggt attgnaaaga tacacatatg atatgnttat taaaattgaa 600 

ataatgtaaa acacatgaat aaatttgcaa aaccaagatc acagtccacc atatgcactc 660 

tggtacctta aatttttttt ataaataatt naaaagggaa tattggaagc ttcttaaaaa 720 

aaaaaaaaan aaaaaactcg agcctntana acttttgng 759 



<210> 667 
<211> 760 
<212> DNA 
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<213> Homo sapiens 
<400> 667 

ggnnttnaaa ctnctaatnc tgtnttgcag gatcccatct atncntatan angctctagg 60 

cggngcggnt cccactctcg gaaccttgtc ctgtttgtcc cccagctcgg caagcgccat 120 

atgagcctgg cggcgccaga tgcgaatcct gttctgggct ttttggccta ttcccgcccc 180 

tcagtcttgc cgggatggca ccgcccgcat aggacttcca gggttgggct gantgggagt 240 

tcgactgctg ggcctcgtaa ttctcgcttt ggggctgctc cttccaggct gggacacact 300 

ggggcccgct gtcggtctcc cgtcctccga catcttgtct ggaacttccg cctggcagtc 360 

tccagtagga gtggagctct gtgcggcgta ntttggtgga aaaacnggcc ttgcgtcggc 420 

ctcaccccca gtgtttgtgt ttcagaatga agactattct cagcaatcag actgtcgaca 480 

ttccagaaaa tgtcgacatt actctgaagg gacgcacagt tatcgtgaag ggcccagagg 540 

aaccctgcgg agggacttna atcacatcaa tgtagaactc anccttcttg gaaaagaaaa 600 

aaaagaggct tccggtttga cnaaatggtg gggtaacaga aaggaactgg ctacccgttc 660 

cggactattt gtaagtcntg tncagaacat gatcaaaggg tgttacactg ggctttccgt 720 

tacaaagatg aangtctgng natgcttaat ttccatnaan 760 

<210> 668 
<211> 763 
<212> DNA 

<213> Homo sapiens 
<400> 668 

gntctatgtg gctctngttn ttttgcggat cccatttgac gccttggcac gagaagaaaa 60 

cccatggaaa gtagcagtgt tgtgagttgc agagacagga aagatagaag acgttccatg 120 

tgttattctg atggtcgaag tttacatttg gaaaaaaatg gaaatcacac accatcctcc 180 

agtgtgggca gctctgtaga aattagttta gaaaattctg aactgtttaa agatttgtct 240 

gatgccattg agcaaacctt tcagaggaga aatagtgaaa ccaaagtgcg acgtagcacg 300 

aggctacaga aggatttaga aaacgaaggt cttgtatgga tttcacttcc acttccttcc 360 

acttcccaaa aagccaaaag aagaacaata tgtacatttg acagcagtgg atttgaaagt 420 

atgtctccca taaaagaaac tgtgtcctcc agacaaaaac cgcagatggc acctcccgtc 480 

tcagatccag aaaacagcca gggccctgct gctggttctt ccgatgaacc tggtnagagg 540 

aggaagagct tttgtatatc tacacttgca aatactaaag ccactttcca gttnaaaggc 600 

tnccggagaa gatcctctct ttaatgggga aagggagaga gctctcttga ctggccttgg 660 

gaaagggatt ggaacataat ggggagaaaa gaaagccgta attgacattt tctggcanan 720 

tcttgtnanc aagaggggna aagtnaccct tntntgcttg aaa 763 

<210> 669 
<211> 754 
<212> DNA 

<213> Homo sapiens 
<400> 669 

tgnttctaat gctngctctc gttctttctg caggatccca tctattcgaa ttgatgagcc 60 

ttattaacta tcttttcatt atgagacaaa ggttctgatt atgcctactg gttgaaattt 120 

tttaatctag tcaagaagga aaatttgatg aggaaggaag gaatggatat cttcagaagg 180 

gcttcgccta agctggaaca tggatagatt ccattctaac ataaagatct ttaagttcaa 240 

atatagatga gttgactggt agatttggtg gtagttgctt tctcgggata taagaagcaa 300 

aatcaactgc tacaagtaaa gaggggatgg ggaaggtgtt gcacatttaa agagagaaag 360 

tgtgaaaaag cctaattgtg ggaatgcaca ggtttcacca gatcagatga tgtctggtta 420 

ttctgtaaat tatagtttct tatcccagaa attactgcct tcaccatccc taatatcttc 480 
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taattggtat catataatga cccactcttt cttatgttat ccaaacagtt atgtggcatt 
tagtaatggg aatgtacatg ggaatttccc actgacttac ctttctgtcc ttgggaagct 
taaactctga atcttctcat ctgttnaaat gtgnattaaa gtatctacct aactgagtng 
tgantgtant gaaagaaagg ncatatntta aacnttgaat ttancaagcc cacnctcgna 
ttttatgncc tttcttttgc ctngggattg aanc 



540 
600 
660 
720 
754 



<210> 670 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 670 

tgnttctaat anttgctact tgttcttttt gcaggatccc ttttgacgnc tttggcacga 60 

gaaagaaagg gctcgtgaca gagaaagaag aaagagaagt cgttcacgaa gtagacactc 120 

aagccgaaca tcagacagaa gatgcagcag gtctcgggac cacaaaaggt cacgaagtag 180 

agaaagaagg cggagcagaa gtagagatcg acgaagaagc agaagccatg atcgatcaga 240 

aagaaaacac agatctcgaa gtcgggatcg aagaagatca aaaagccggg atcgaaagtc 300 

atataagcac aggagcaaaa gtcgggacag agaacaagat agaaaatcca aggagaaaga 360 

aaagagggga tctgatgata aaaaaagtag tgtgaagtcc ggtagtcgag aaaagcagag 420 

tgaagacaca aacactgaat cgaaggaaag tgatactaag aatgaggtca atgggaccag 48 0 

tgaagacatt aaatctgaag gtgacactca gtccaattaa aactgatctg ataagacctc 540 

agatcagaca gaggactact gttcgaagat ttttggaaga atactgagaa cggcataaag 600 

tgaagatcga catttaaaaa atgaggtgaa agaaagctnt tgtggcatag aaaaagtntt 660 

aagctcaant agttttttta ttattattat tattaaaagt tattcaggac tgatgtgact 720 

ncngatttna gaacatgtgg taatagtnta nt 752 

<210> 671 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 671 

tgnttctaat gttgctactc gttcttttgc ggatcccatn ttattcgaat tcggcacgag 60 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 12 0 

atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 180 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 24 0 

gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 300 

cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 360 

atcccagtta cttgggaggc tgaggcacaa gaatcgcttg aacccgggag gcggaggttg 420 

cagttagcca agatcgccct gctgcactcc agcctgggca acagagggag actctgtctc 480 

caaaaacaaa aacaaaaact gttagtgaag gttccctggg acttttgata ttttaaaaat 540 

tggtcttatg actagtagat aaattcattg ccataatgag gctagctccc agataaacag 600 

tgtattttct tctttttttt ttttggtgag tggtccaaac tttaagctac tttttccagt 660 

antttgccac tttctccgan gtaantttgg ctggtctttn agtaatgcta attgngtgtc 720 

aaaatttgtc tacaacagtt nggcaacaga tn 752 

<210> 672 

<211> 792 

<212> DNA 

<213> Homo sapiens 



203 



WO 99/33982 



PCT/US98/27610 



<400> 672 



tgnttctaat actngctact 
ggctgcttct ggctgggggg 
agagccctgg aggagcagga 
ctcaggtgtt gggaaatggg 
aaccagatgg caaggatgag 
tgttgcctcc acccccagca 
tctcacggcg ggcagtttat 
ccttctgccc ccgacctgac 
ctggacnaca ctggagtgga 
aaccggcaca ctggacttct 
atnnttgggg gatcccnnaa 
canccctttc cccttggccc 
gtangaaaan anaatantnt 
antnatcntn nt 



ngttctttct gcaggatccc tctattcgaa ttcggcacga 
tccttggcct tcatcctgct gagggtgagg aggaggagga 
ggaggagcca gtggcgacgg gggattctac gatccgaaag 
gaccccgtct tctggacacc agtagtccct ggtcccatgg 
gaggaggagg aggaggaaga gaaggcagag aaaggcctca 
ctcgaggatg acatggagtc ccagctggac ggctccctca 
gtgtgacctg gacacagaca gagacagagc caggcccggn 
cacgccggcc tagggttcca gactggttgg acttgttcgt 
acactgnctc ccactttctt gggactttgg agggangtgg 
tccgtctcta nggctgcatg gggagccctg gggagcttna 
aangaccccc tgtcccccat anacttgggt ttttngcttt 
cnnttgacca cttcatggag tttaattaaa atngcccttg 
tcctcntttt antgntnttt tnntataatt tnatnatcct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
792 



<210> 673 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 673 

nttctaatnc tngctacttg ttctttntgc aggatccctc gattcgaatt cggcacgagg 60 

cagcttcgag ccaatggtga gctccttctg gatcagctcc ttcagctcct tcttgctcag 120 

gatgctgaaa ttgcaaggct gatggaagac ttggaccgga acaaggacca ggaggtgaac 180 

ttccaggagt atgtcacctt cctgggggcc ttggctttga tctacaatga agccctcaag 240 

ggctgaaaat aaatagggaa gatggagaca ccctctgggg gtcctctctg agtcaaatcc 300 

agtggtgggt aattgtacaa taaatttttt ttggtcaaat ttaaaaaaaa aaaaaaaaaa 360 

ctcgagcctc tagaactata gtgagtcgta ttacgtagat ccagacatga taagatacat 420 

tgatgagttt ggacaaacca caactagaat gcagtgaaaa aaatgcttta tttgtgaaat 480 

ttgtgatgct attgctttat ttgtaaccat tataagctgc aataaacaag ttaacaacaa 540 

caattgcatt cattttatgt ttcaggttca gggggaggtg tgggaagttt tttaattcgc 600 

ggccccggnn gccaatgcat tgggccccgg tacccaactt ttgttccctt tantgagggt 660 

taattgcncc ccttggccgt aatcatggta atagctgttt cctggtgnga aattgtttcc 720 

cgtnacaatt ncacacactt ttcancccgg ggacn 755 

<210> 674 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 674 

tgcttctaat gcttgctact cgttctttnt gcaggatccc tcgattcgca gatttttgac 60 

aaggaaggct aattctaaac ctgaaagcat ccttgaaatc atgcttgaat attgctttga 120 

tagctgctat catgacccct ttttaaggca attctaatct ttcataacta catctcaatt 180 

agtggctgga aagtacatgg taaaacaaag taaatttttt tatgttcttt tttttggtca 24 0 

caggagtaga cagtgaattc aggtttaact tcaccttagt tatggtgctc accaaacgaa 300 

gggtatcagc tatttttttt taaattcaaa aagaatatcc cttttatagt ttgtgccttc 360 

tgtgagcaaa actttttagt acgcgtatat atccctctag taatcacaac attttaggat 420 

ttagggatac ctgcttcctc tttttcttgc aagttttaaa tttccaacct taagtgaatt 480 

tgtggaccaa atttcaaagg aactttttgt gtagtcagtt cttgcacaat gtgtttggta 540 
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aacaaactca aaatggattc ttaggagcat tttaatgttt attaaataac tgaccatttg 
ctgtanaaag atnanaaaac ttaagctttg ttttactaca acttgtacaa agttgtatga 
cagggcatat tctttgcttn caanattttg ggttgggggc actangggtt caaaaccctg 
gcanaattgt cnactttagn ctgaccataa tnc 



600 
660 
720 
753 



<210> 675 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 675 

tgntttctaa acnttgctct cgttntttnt gcaggatccc atctattcga attcggcacg 60 

aggttccctc accttattcc tccaagttcc cccttgggaa cctctgagat taacttgata 120 

agctccttgg gcaagctctt tatcctaaga ttcctcagtg agccttatag agttgctgcg 180 

agaattacat ttgttcatga tgtcaagtgt ctggtatgta gctaatgctt attgaacaca 240 

tagtaattta ttgaataatt gtcatgatca ctggatgaga tatagccact gtggaggtag 300 

gcacaccagg gttttagagg cttgggatct tgcaacagga ttttcctctt gcctctccaa 360 

actgcccttt gcccagatgg cttcagcatc tttttgcatc cctgtttcct tgtttggtga 420 

acacctgtct caacctgtct gcaaggcgtg gtgagattct gcatccttgg taagcactca 480 

tgtcactcca aaacagctgt ttgatgctaa tagcacacat gaggtcttgc aaatttgtct 540 

gaggaactac aggacattgg agagatattt atcaaacacc cactacatgc ctgatactta 600 

actanggaac tatnaaagtg ggtggtgaag acaagtgnga agtaaantgc aaacctattt 660 

ccatatatgt ttgnncgcta gattgntncc ancaattngc ntcttggaat tgttgaattn 720 

ggccctgtgt gtgtgcctgt ggtaantgga nntgngtttc 760 

<210> 676 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 676 

ntttgaaact tnctactngt tctttttgcg gatccctcna ttcgaattcg gcacgaggca 60 

gaaccttttc ccctctactc ttgtctaaaa gttctgtgtg gcacacagag atgcgaccta 12 0 

ctcaatctga cttagtaaaa ccatgctgta gaatttttgt cttaaaaaga ccacataccc 180 

agcacccatg aaataaaaga ttcatctgta attgggattc aaagtgatta aattcctttg 24 0 

ttcatactca taaatagcac taaagtgtta taacattttc atttacctat ttttagttcc 300 

ttcattttaa cttaataaaa atcttggatt gatattcttt tttttttttt ttgggacgga 360 

gtctcgctct gtcacccagg ctggagtaca gtggctctat cttggctcac tgcgagctcc 420 

gcctnccggg ttcacgccat tctcctgcct cggcctgccg agtagctggg actgcaggcg 480 

cccgccacca cacccggcta atttttttgt atttttagta gagacggggt ttcaccctgt 540 

tagccaggat ggtctcgatc tcctgacctc gtgatccacc tgcctnggcc tccaaagtgc 600 

tggaattnca ggcgtgagcc accgcgcccg ggnctaaatt ggatattctt taaccattaa 660 

aaggtttact gggtgnccna tttgccatat tattggaaac ttggaaaggg taatttgaaa 720 

caaagntttg aagttaactg aaatttgggg a 751 

<210> 677 
<211> 756 
<212> DNA 

<213> Homo sapiens 
<400> 677 
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tgctttgaat cctttgtaan cgccctntnt gcatgatccc tcnattcgaa ttcggcacga 60 

ggataaactc ttcagtgacg aatattagaa ttagttagtt atacatttga ggaaaactat 120 

aaaagtacca ataatgagta ggaaatcact tctgcagtat ttttggagca ttttccttaa 180 

gcatgacata aaagccaaag gtcacaaggg aaaaaactga tagatttgtc tgtgatattg 240 

agagatgtat gcacatatac atacaacagt catagtaaga caccgttaga caaaaggtga 300 

tgtatgaaaa agaggcaaaa caacaagaag aaaagattga aaaaatgaga gctgaagacg 360 

gtgaaaatta tgacattaaa aagcaggcag agatcctaca agaatccagg atgatgatcc 420 

cagattgcca gcgcaggttg gaagccgcat atttggatct tcaacggata ctagaaaatg 480 

aaaaagactt ggaagaagct gaggaatata aagaagcacg tttagtactg gattcagtga 540 

agtttagaag cctgaaactt ttctcgtatg gggtggtttt tgcattaaat nctggggtcc 6 00 

attttacaat ccattatttt tgaccactgc tatgtgttca agtagtatga gaatgtgatt 660 

gntnttatct ggntcatata tatttctttg gctaatttaa tatgtcaaat aaatgagttc 720 

atttaaaaaa aaaaaaaaaa acccggactg ttttnt 756 

<210> 678 
<211> 756 
<212> DNA 
<213> Homo sapiens 



<400> 678 

gnnnnnnnnn nnnttnnaat agnnagctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gaggggtgtt ggagcagatt gtagttgatc cacagcaaag agcatcacca 120 

aagccattcc aggaggaact agatccacca cttcctctgc tgggcatgct ccaaaaatgg 180 

ttgtggcttc cagagaggac tccaaaagaa agcacaaaaa ctagacagtg ggagggcata 240 

cccaaaagcc ctgagtttct gaaaaaatat tgaaagtttc tatggtgaaa taggaagtta 300 

atgtgcttag gaagaaaaaa gtggtaatga ttcaaggaaa cataatcaca cacggtttta 360 

gttttaatgg acatgggagg agccataaaa gtagtctatc tatcatcagt tacatatcta 420 

atgaactgtc tatctgggat accctatcct gttttaatct gagtgactct ctctcagctg 480 

agagagctgg acagactcca ttttagcctc ttcacttgca gtccccttat ccccctccct 540 

taagggaata actagtgcaa gctgacttca agcacattca ggaatgcact tactgataag 600 

atattgaggc aagctgtacc agcagcttct gggggacctg ctcantggat ggtcccaacc 660 

cctgcattta tctctttggg atagtttaag cccctgnacc tggaactgng tatttttctg 720 

tactatctct gtancattaa tttttttact ttttgg 756 

<210> 679 
<211> 747 
<212> DNA 

<213> Homo sapiens 



<400> 679 

tctaatnctt ggctctcgtt ctttctgctt gatccctcga ttcgaattcg gcacgagaaa 60 

tgactccctg caaaacccaa cccatgctgc tggctgtggg atttttggtg taagcctatc 12 0 

tatgcactct atcagccaga atttggcatt tagctcttag ttaaatctag taaaggacag 180 

tctattgttt aaagagaagg tgcatttgtt cctcaatcaa gcaagagcac ctgtgttgta 240 

ctgctttata tctcatgtat atttatagta atgaaaagac tttttaaatt gtacacgttt 300 

cagtgccttt cttgtgttat gaaaggcagg tagatattat agccataggt aaaaatccat 360 

agttaaattg cacactgacc ttaaatctct ctgtgtatgc ccttgtatct tgcatgttaa 420 

aagttggatt attgggcatg tgtggcagcc tgccctgcta catgctagac aagtgtgctt 480 

tagtacatag ccacaagttc ttcattcttt aaaatgtttt gacagatcat ctcataataa 54 0 

aaataattca ngaaaactat ggggaaatag ttacatttca caaaagatat tttaaactct 600 

ttgtaaaact tagataatag agcctancaa gttactttgn atctaattgg atacatttta 660 
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tgnttaattt taccaccata cattttatta atcaaaattg gttagcatgt gactcttttt 720 

ggcttcanaa gttntcaaaa aaattat 74 7 

<210> 680 
<211> 750 
<212> DNA 
<213> Homo sapiens 

<400> 680 

ttctaatnct tggctctcgt tctttctgca ngatcccatc gattcgaatt cggcacgagg 60 

accggctggg cctacaaaaa gatcgagctg gaggatctca ggtttcctct ggtctgtggg 120 

gagggcaaaa aggctcgggt gatggccacc attggggtga cccgaggctt gggagaccac 180 

agccttaagg tctgcagttc caccctgccc atcaagccct ttctctcctg cttccctgag 240 

gtacgagtgt atgacctgac acaatatgag cactgcccag atgatgtgct agtcctggga 300 

acagatggcc tgtgggatgt cactactgac tgtgaggtag ctgccactgt ggacagggtg 360 

ctgtcggcct atgagcctaa tgaccacagc aggtatacaa gctctggccc aagctctggt 420 

cctgggggcc cggggtaccc cccgagaccg tggctggcgt ntccccaaca acaagctggg 480 

ttccggggat gacatctctg tcttcgtcat ccccctggga nggccaggca gttactcctg 540 

aggggcttga acaccatccc tnccactagc ctctccatac ttactcctct nacagcccaa 600 

attcttgaaa gttgtctccc ttgacccttc tttaatggca acttaactga anaaagggat 660 

gtncncttat atccaaaatt cagctatttg gcaaataaac canatggatt aaaaaaaata 720 

attntntctt aananaaana actccggcct 750 

<210> 681 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 681 

ctaatncttg gctctcgttc tttctgctng atccctcgat tcgaattcgg cacgagccca 60 

gctgctcagg aggctgaggc aggagaattg cttgaaccca agaggcggag gttgtggtga 120 

gccgagattg cacctttgta ctccagcctg ggcaacgagc aaaaaactct gtctcaaaaa 180 

aaaaaaaaaa aaagaaaaag aaaaatggct tccaggacag agcatgctca tttgctggcg 240 

gacagttcca gaaacagacc ctgttagtcc ttctacttac ctgctggatt tttcaagcac 3 00 

taaatttata actttttgaa acaaaataat gtgtaatttt ccatttgggg gcaaactcta 360 

ttcttgtgag cattattaaa atcttgtttg taaatatatt gtctttctct taatatttgc 420 

tctgggtcan gaagaagctg ttcacggtgt gataatactc tttanattgt gctttcatta 480 

ttatagatgc atcatgtctt ctgctttcac gtgtctggga tggggtcaga aatgcatnct 540 

ccagntgaca naaaaatccn agnatgagat caanaaggat actggtgttt tctgactttt 600 

acaaaaatta ctttgntgtt ttcattaaaa aaaaagcttt aacctantgn ttncntantc 660 

cttttagaaa ntattaaatt tnaaaatgaa ttcnatanaa atanaannac naaaaaactt 720 

nntnccttta naactttagt gangcgtn 74 8 

<210> 682 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 682 

ctaatgctng gctttcgttc tttctgcagg atccctcgat tcgaattcgg cacgagcagg 60 

agcaatcaat tcctgtcgaa gtgaatacca tgcagctttt aacagtatga tgatggaacg 120 
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catgaccaca gatatcaatg cactgaagcg gcagtactct cgaattaaaa agaagcaaca 180 

gcagcaggtt catcaggtgt acatcagggc agacaaaggg ccagtgacca gcattctccc 24 0 

gtctcaggta aacagttctc cagttataaa ccaccttctt ttaggaaaga agatgaaaat 300 

gactaacaga gctgccaaga atgctgtcat ccacatccct ggtcacacag gagggaaaat 360 

atctcctgtc ccctaccgaa gaccttaaga cgaagctcaa ctncccgtgg cgaactnaca 420 

tccgagtcca caaaaagaac atgccaagga ccaagagtca tncgggctgt ggggacaccg 480 

tanggctgat agatgagcag aacgaggcca gcaagaccaa tgggctgggg gcagcagagg 540 

cattcccctt tggntgtcan gcgacagctg ggagagaang caagnaagcc ctgaangcna 600 

gtccaggagg accnncnaag ggcagtttcc ggagcccgnt gttccggaga tgctgatgtg 660 

ggntgtgtct gcanttcang gccaaanttg gggacccctg ggaactgtac cctangggnt 720 

ncttgnagnt taaaacttga ccttaanggn ngcct 755 

<210> 683 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 683 

ggntttnnnt ctttctaatg cttggctctc gcctntctgc ttgatcccat cnattcgaat 60 

tcggcacgag aattagtatc aacttacaat ccaagtccaa gtatcatctt ataatcactt 120 

ttttctacta tattaagatc taatgaattt gatttctttt ttgaagtttt ttcttgtaac 180 

atctgagatt agaagtttaa gatcacttga ccccaaacct ttgtttatgt aagaattttt 24 0 

aaacataaaa gtgtttgttt ctgttatgtt accataattt gatgtatata gtgtccagat 300 

ccatttagaa atttaatatt tattaataac tgaaactgtt tgtcttcctt tggtatatag 360 

tctcgcatat tatattatag caggccaaga taaaattttg acagctcttt aagcccacat 420 

gcagcagtgg gtcagataac cctgtggcag tgacacgggc aaattggcat ttgaataaag 480 

ccctgggacc acctcaacat gcgtagcctc ttgtcttaaa tgtactcccc atggcagcat 540 

ggaggaggca agacctgtgg gtcaattttg aactggnctt actttgattt taaaacaaga 600 

gactcagggg aaagtactaa accaaaaact ctgattntac tttgcgtttt ctggaagtnn 660 

ttggtttact gagatgcttt tgtaaaggaa aataatgctt gngacanttt agtaatttct 720 

acanaattcn ttaatatttc ttcctcntgg gcttn 755 

<210> 684 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 684 

ggntttnann ctttnnaatn cctttgctnc tcgntctttn tgctggatcc catcgattcg 60 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 12 0 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 180 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 240 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 300 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 360 

actcaacaga aagtggagat actaacagct cttaggaaac ttggagagaa gctgactgca 420 

gatgatgagg ccttcttgtc agcaaatgca ggtgctatac tcagccagtt tgagaaagtc 480 

tctacagacc ttggctctgg agacaaaatt cttgctctgg caagttttna ggttgaaaaa 540 

acaaaaaaaa tgacatgggt gcagaagctt gtaacattga tcacattctt aatgtaaatg 600 

gtgtctttct tctggggttt cagtatttgc aaagaaantg aagaagaatt ctggaaatgc 660 

cattcaatta accctnagga aaaaagccga ccttanaaat ttaccttant gcnttgnnnn 720 

ttaaaaanaa aaaaaantna aaaaactttn accctttana ccttttgtgg ggnc 774 



208 



WO 99/33982 



PCT/US98/27610 



<210> 685 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 685 

ggntttnnan nctttctaat ncttggcttn agttcttttg caggatccca tcgattcgaa 60 

ttcggcacga gagtacccag agttgcgagg agttttttaa ctgatttagc cnnntggcaa 120 

tcatgagtga atggatgaag aaaggcccct tagaatggca agattacatt tacaaagagg 180 

tccgagtgac agccagtgag aagaatgagt ataaaggatg ggttttaact acagacccag 240 

tctctgccaa tattgtcctt gtgaacttcc ttgaagatgg cagcatgtct gtgaccggaa 300 

ttatgggaca tgctgtgcag actgttgaaa ctatgaatga aggggaccat agagtgaggg 360 

agaagctgat gcatttgttc acgtctggag actgcaaagc atacagccca gaggatctgg 420 

aagagagaaa gaacagccta aagaaatggc ttgagaagaa ccacatcccc atnactgaac 480 

agggagacgc tccaaggact ctctgtgtgg ctggggtcct gactatagac ccaccatatg 540 

gtccagaaaa ttgcagcagc tctaatgaga atattctgtc ncgtgttcaa ggatcttatt 600 

ggaaggacat cttacagctt ccaatgagaa gccaagaagt tgtgaacata ctgattgaaa 660 

aaagacttta ttttaatccc tcattaaaan ggttttaaat gttaaaaaaa aaaaaaaaaa 720 

acttcgagct tttaaactat ngtgagtcga ttcntataa 759 

<210> 686 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 686 

ggnnttnnnn nctttgaaat cccttngctn ctagcncttt ttgcaggatc ccatcgattc 60 

gaattcggca cgagggcaat tagcctcgct taagttgcct tttttacaca ccaaaacttt 120 

ttacatgaag ggctggtttc acatgaatac tatactgaaa tctgtgctct caagatctag 180 

cagtgaccag ggctgcccgg cgggggctct cctggcaagt caggaaggtt tctgttgcta 240 

atataacata gaaacacatt agtgcactgg gcctctctga ggtcagcata tttgtactct 300 

tggaatattt gtttttttct tcagtaacaa cagaaacccc agttgggagt ttaacaaata 360 

actgactacc actcactcat gcatttttat ttccaattaa agcaaagcac tgtgctgtgc 420 

tcagataata atagtttgta agtaaaagtt tttagttttc agtgttcagg ttatagaata 480 

taactgacca taaaaattac ctgcaggtat tttcttttta tgaacttgtt tttaaattac 540 

caagtaatta ctggtgtcat tttgttttat gacagacaca cgtatctaac aaacaaacaa 600 

acagtgacct tctccatggg tcaaggactt ccttacaatt tctnctgagt taacttttgt 660 

gaaaataatc ctaaggtttt ctggcttatt gaggaaattn ctacaaacaa caaaccaaca 720 

acngaagaga agatcatcaa ccactgttt 74 g 

<210> 687 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 687 

ggnnttctaa tgctttctaa taccttggct ctngctcttt ctgcaggatc ccatcgattc 60 

gaattcggca cgaggaaatg tgtatttcag tgacaatttc gtggtctttt tagaggtata 120 

ttccaaaatt tccttgtatt tttaggttat gcaactaata aaaactacct tacattaatt 180 

aattacagtt ttctacacat ggtaatacag gatatgctac tgatttagga agtttttaag 240 

ttcatggtat tctcttgatt ccaacaaagt ttgattttct cttgtattac attttttatt 300 
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tttcaaattg gatgataatt tcttggaaac attttttatg ttttagtaaa cagtattttt 360 

ttgttgtttc aaactgaagt ttactgagag atccatcaaa ttgaacaatc tgttgtaatt 420 

taaaattttg gccacttttt tcagatttta catcattctt gctgaacttc aacttgaaat 480 

tgtntttttt tttctttttg gatgtgaagg tgaacattcc tgatttttng tctgatgtga 540 

aaaagccttg gtattttaca ttttgaaaat tcaaanaagc ttaatataaa agtttgcatt 600 

ctactcanga aaaagcatct tcttggatat gtcttaaaat gtatttctgt cctctataca 660 

naaaagttct taaattgatt tttacagtct ggaatgcttg gatgntttaa aatantaaca 720 

ttttatattt tttaaaagac aaancttata ttnatcctng 760 

<210> 688 
<211> 752 
<212> DNA 
<213> Homo sapiens 



<400> 688 

tgntttctaa tgcttctaat agcttggctc tngttctttc tgcaggatcc catcgattcg 60 

aattcggcac gagacaaaac ctacagatgg agataaaaat tactactgtt attcaacatg 120 

tgttccagaa ccttattttg gggagtaaag tcaattgggc agaggatcct gcccttaagg 180 

aaattgttct gcagcttgag aagaatgttg acatgatgta ataagaattc atttctgaca 24 0 

tattttacat ttctggcaat ctcaactctt atttggaata cttctgtgca tttgtctgtc 300 

caccgtaatt ttagaaaagc atatccataa cgtttacagt tgtagtacag ttgtggttag 360 

ttatttgtag tgggattgaa agtaattttt ttctttttat atttctatat ttagtttgtt 42 0 

tttttgttgt tgttgttttt tgagatggag tctcgctttg ttgcccagac tggagggcag 480 

tggcgcgatc tcggctcact gcaacctctg cctcccgggt tcaagcagtt ctgcctcagc 540 

ctnccaagta gctgtgacta aaggtgcacg ccgccatgcc canctaattt tttggatttt 600 

aagtagaaac cgggtttcac ccgtgttgcc caagctgctc tnaaaactcc tgagctcaag 660 

cagtccaccc gncttngcta ccggantgct aggattcaga cgtaagcccc cgaancctgg 720 

ctagtttgcn ttnttttctn tcattttata ag " 752 

<210> 689 
<211> 806 
<212> DNA 
<213> Homo sapiens 



<400> 689 

gtgntttcta atgcttctaa tngcttggct actcgttctt tntgcaggat cccatcgatt 60 

cgaattcggc acgaggannt ctntgctatn gaacagnggc tggtnnacac tnnggantta 120 

nnnntgnacn ntannnattg nancanntan tactggnnnt ccntaatncn nttaatgtna 180 

cntnttgcaa gnngnnctga tnaaatacac gacaggaggg aaanctantg cgtcataggc 24 0 

acaggcagac ctaccgnnta aggagatnat ntnccnnang gntggctgtt gagnncatgc 300 

aactctggna tgtatttccc tttataggac caccttgtnc atngtggata aagcccctaa 360 

agnaggatgn naaagatgat cngatccaat acgttacnct gacannaaan nntgtnatac 420 

ntcngctgan caatctntcc ancnnntnta atatcgtgna tcacctaggg tgtatgatcn 480 

taggaactct gcncctncan tcnggactgt ccatcacnga ctnntgggct nctactgtac 540 

antangcgna gaanancnnt cannctacan ntaaccagat tggtgctgnn anatggtant 600 

gcnntttnan cncccacgac ncaataaagn ncnnctntnc cccanancct ntnnagggaa 660 

gaaaggaatt ttncatagtg ggctcaatga anggggtacc cttggncttt ntaaaaaacg 720 

ttncatggnn cctaccttaa acctgngtna actnanancn nttngncata angggtctaa 780 

cgnctatang gggnacnnat ttttnc 806 

<210> 690 
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<211> 772 
<212> DNA 
<213> Homo sapiens 

<400> 690 

ntntttgaat ctttgaaata cctttgctat ngttctttnt gcaggatccc atcgattcga 60 

attcggcacg agaggttgct cacctgaagg agcacaggag ggttttccag gccatgtggc 120 

tcagcttcct caagcacaag ctgcccctca gcctctacaa gaaggtgctg ctgattgtgc 180 

atgacgccat cctgccgcag ctggcgcagc ccacgctcat gatcgacttc ctcacccgcg 240 

cctgcgacct cgggggggcc ctcagcctct tggccttgaa cgggctgttc atcttgattc 300 

acaaacacaa cctggagtac cctgacttct accggaagct ctacggcctc ttggacccct 360 

ctgtctttca cgtcaagtac cgcgcccgct tcttccacct ggctgacctc ttcctgtcct 420 

cctcccactn cccgcctacc tggtggccgc cttcgccaag cggctggccc gcctggccct 480 

gacggctccc cctgaggccc tgctcatggt cctgcctttc atctgtaacc tgctgcgccg 540 

gcaccctgcc tgccgggtcc ttgtgcaccg tccacacggg cctgagtttg gacgccgacc 600 

cctacgaccc tggagaggag gacccagccc aagacccggg cctttggaaa acttccctgt 660 

gggaagcttt aagnnccttc nanangccac ttacccaacc ttgaggggnt ccaaangccc 720 

gccanccggt nattaaccaa ggccctggnc aatgcctgaa ggtcaaacaa tn 772 

<210> 691 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 691 

ntgctttcna atctttntaa atgcctttgg cttctcgntc tttctgcagg atcccatcga 60 

ttcgaattcg gcacgagaaa aagtaaagct tttcatgagc acaaatncct tgcattgttt 120 

gatgttactg atattcgtaa aatgaatatt ttttgttttg ttttgtttta tttttttgag 180 

acaagtcttg ctttgttgcc caggctggag tgcaatggca tgatcttggc tcactgcaac 240 

ccctgccttg cgagttcaag tgattcttct gcctcagcct cctgagtagc tgggattaca 300 

ggcgctcacc accacaccca gctaatttct gtatttttag tagacacagg gttttaccat 360 

gttggccagg ctggtctcaa actcctgacc tcaaactcct cacacctgta atctcagcac 420 

tttgggaggc tgaggtggaa ggatcacttg aagccagagt ttgagaccag cctgtgcaac 480 

acagcaagac cccgtctcta caaaaactta aaaaattagc tggctgtggt gttgctcacc 540 

catagttcca gctactcggg aagctgagca ntaagatcac ttgagcccan gaggccnatg 600 

cttncantga actgtgattg tttccantac agnccacctg ggtgacanag taaanaaaan 660 

gaaacattac ataatttggc tagagcataa taaattgatt tctgggttnt gaaattnnag 720 

ttgccataaa aggnntttna atgngcnant tcant 755 

<210> 692 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 692 

tgnttttaat cnttctaatn cttggctctt gttctttttg caggatccct cgattcgaat 60 

tcggcacgag gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca 120 

agaaacacag aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga 180 

gcatcctcag aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta 24 0 

atgagcctct gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcatcc 300 

acaaggccag ttcactctct gatcctgtgg aaaggatgca gtgtgtagct gcgtttgctg 360 



211 



WO 99/33982 



PCT/US98/27610 



tatctgctgt tgcttctcag tgggaacgga ctggaaaacc tttcaaccca ctgctgggag 
agacttatga attagtgcga gatgaccttg gatttagact catctccgaa caggtcagcc 
atcacccacc aatcagtgca tttcatgctg aaggattaaa caatgacttc atctttcatg 
gctctatcta tcccaaactg aaattctggg ggaagagtgt agaacagaac ccaaaggaac 
catcaccttg gagctncttg aacacaatga ggcatataca tggacaaatc cacctgctgt 
gtgcataata tcattgnggg taaactgtgg atcgaacagt ntggcaatgt ggaaattnta 
accncagact ggggacaaat ntgtgttg 



420 
480 
540 
600 
660 
720 
748 



<210> 693 
<2il> 881 
<212> DNA 
<213> Homo sapiens 

<400> 693 

tgnnnngtna accagggaaa agctnngttt gaactccttg ggcatgatcc catcgattcg 60 

aattcggcac gaggcggtga cccacgtgtc cttttgattg ccctactgct gtggagacct 120 

cgtgctgacc atctggcagt gntcttcgta ttctctggcc tgtggggcgt ggcaagatgc 180 

ccgtctggca gacacaaaac aatgctctct acggcgttct gtttganaag agcaaggaag 240 

ctgccttcgc caattaccgc ctgtgggagg ccctgggctt cgtcattgcc ttcnggtaca 300 

gcacgttttn gtgcntgcac gtcaagctct acattctgct gggggtccng agcctgacca 360 

tggtggcgta tgggcttgtg gantgcgtgg agtcccaaga accccgaatc anacccnact 420 

ctttcaggac aggtcaanca agtcagagga tgaagaanat tcanacaaan atgtgatanc 480 

cngngaggcc naangaggan naantnataa aagcaccagc cagaagaatt ttcttanaan 540 

atgcctnagg gacatatcan ccggggttct cattacccat cttaancncc anatttngnc 600 

ccattcttga aataagantc nttgnttnaa ttntcaactt ctttttatgg tnatttcnat 660 

ntatctantt antaaaacca caaatntgtt nncnatnacc accanttctt ttaaaccatn 720 

tagnaattca aangntgtgt nnttacnaat ntntaanggg ttattcaaan ttcnaaattt 780 

taaanattnt tatgcantnc ncacaatnta tataanangg tcctnaaaac gngnnccaat 840 

atnncannnc nataatntag nanatntntn nnccntgtan n 881 

<210> 694 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 694 

atngcttggc tctngttctt tctgcaggat cccatcgatt cgaaaattta tagtaatgac 60 

aaatgactta tcagtgttca tcatctgaaa gctaagtggt tcgttcaatc actttttcaa 120 

agttgatagt agattgcatg gtttcatgtt tcctcatatt ggtttattaa ttctatttaa 180 

tcaaggaaaa taacttcaga ttccataaag tttcagttta tttttagttt actactaggt 240 

gagatagcac attacatact tttactatca aatattattt tagcagcttc ccatagtacc 300 

aaatgatttg attccctact ctcatttttt aaagcatata aatatttatg ggcttaaaaa 360 

gggggttttt aaaaactgag gatatcanta ataaattgca gaatattttg caaagctttc 420 

ttttggaaag caaacttttg tgcctgccta tatgcnaagt attttatcag ggacttgaac 480 

aaagacctca ctctttttca cttgtcttat gtcgagagaa aaggttattg gcagncacat 540 

tcctaanact ggggaatggt gtgtnctttt naaatttgaa gataactttt agggtaatta 600 

tggaaactcc tcaaangagg ganaaagtna tttttttcca gacatttttc ctcaattctg 660 

ggtctttcac acactanntt tccatagtnc nagaatttct gnntttttac catttgggct 720 

gtgaaatgtt cacaatntcn ng 742 



<210> 695 
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<211> 745 
<212> DNA 

<213> Homo sapiens 
<400> 695 

tttcaaatng cttggctact tgttcttttt gcagggatcc catcgattcg aattcggcac 60 

gaggctagac gaagtggtga agcccaaaga cttatttttg agctcgctgt aagactgaga 120 

aatcacgtag tccttcctga aaccactaag aggaaaaatg tctgtgacac tgcatacaga 180 

tgtaggtgat attaaaattg aagtcttctg tgagaggaca cccaaaacat gtgagatgga 24 0 

gtctcgctgt gtcccccagg ctggagtaca atggcgcgat ctcggctcac tgcaacctcc 300 

gcctcctggg ttcaagcaag tcttctgcct cagcctcccg agaactggaa gaggaggcaa 360 

cagtatttgg ggcaagaagt ttgaggatga atacagtgaa tatcttaagc acaatgttag 42 0 

aggtgttgta tctatggcta ataatggccc gaacaccaat ggatctcagt tcttcatcac 480 

ctatggcaaa cagccacatt tggacatgaa atacaccgta tttggaaagg taatagatgg 540 

tctggaaact ctagatgagt tggagaaagt tgccagtaaa tgagaaagac ataccgacct 600 

cttaatgatg tacacattaa gggccntaac tattcatgcc aacccatttg ctcagtagct 660 

attgatngan ctggacaaat tactttgncc aaattgctng aacacacttt attggggggt 720 
taccccgntt ttaattatgt canaa — 

<210> 696 
<211> 795 
<212> DNA 

<213> Homo sapiens 
<400> 696 

tttcaaatng cttggctant ngttcttttt gcaggatccc atcgattcga attcggcacg 60 

aggctggcca aagccaaatc tcctaagtcc accgcccagg agggaaccct gaagcctgaa 120 

ggagttacgg aggccaaaca tccagctgca gttcgcctcc aagaaggggt ccatggccct 180 

agtcgagtcc atgtgggctc tggggaccat gactattgtg tccggagcag gaccccccca 240 

aaaaagatgc ctgccctagt cattccagag gtgggctccc gatggaatgt caagcgccat 300 

caggacatca ccatcaaacc tgtcttgtcc ttgggcccag ctgcccctcc gcccccatgc 360 

atanctgcct cccgggagcc gcttgatcac aggactagca gtgagcaggc agatccctca 420 

gcaccctgcc ttgccccatc cagcttgctg tcccctgagg cctnaccctg ccggaatgac 480 

atnaacacta ggactncccc tgaaccctca gccaagcanc ggtcaatgcg ctgttacccg 540 

aaaaagcctg caggtcaagc cagcccctta agccagggct tggcangggc ccgccnaagg 600 

ccgnaacaag accgntctgt naactcttgg gttccaaacc cggaactttg cccgaaagca 660 

tttntttccc ttaattcctt caattcaatc cggnctttcc ttaatttccn ggattcttng 720 

ggtccaaggg tccccttttt tcccccccaa naacaaagaa aaggttgggc ccgaaanggt 780 

cccaaccttn ttnnt 795 

<210> 697 
<211> 734 
<212> DNA 

<213> Homo sapiens 
<400> 697 

ctaatagctt ggctactcgt tctttntgca ggatcccatc 
cccctgtcaa gtcacttacc atgcaaacca caggctctaa 
ccatccaagt catctccatg gctcctgggt cccctggtga 
caatcatcat gctggggttg gtgcgagagg ggccacagac 
ctggggcagc tgcccctcag tgtcagaggg gctcgacccc 



gattcgcagc cctcttccct 60 

gagtttgtcc ccagggacat 120 

gcatggagtc aggaggtcat 180 

ctgaaaccaa atggatctga 24 0 

tccggtctct aaggaagtcc 300 
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caaagagaat gctctgtggg tccctagcat ctgaggagga cgggctcctt cagaactcgg 360 

gctgggtggt ccgagcgact catgatttgc atgggactct ggcaatctgt agccccaatg 42 0 

ccttgatgtc ttcctcatta acactgtcac gtctcaccag gaatacagtg acattaaaag 480 

tgtgatatgg tntagctgtg cccccaccca catttcaact tgaactgtat ctatctccca 54 0 

gaattcccac atgttgtggg anggacccag ggggaggtaa ctgaatcatg gnggctggtc 600 

ttttcccgtg ctattctcgt gaatngtgaa ntttnacgag atctgatggg tttatcaggg 660 

gttttccaat ttttggttct tatttttctn ttgcaatctg catttaagna antgccnttn 720 

ggtctctaac antn 734 

<210> 698 
<211> 728 
<212> DNA 
<213> Homo sapiens 



<400> 698 

ttcnaatngc tnggcttttn gttctctttg caggatccca tcgattcgaa ttcggcacga 60 

ggtttaattt aaacctctca tcttttttta agcactcact gantttgacc gagacagcca 120 

gtcgccgttg aggaatcctc tgttgtcaac atcgagaccc ctggttttcg ggaaacccaa X80 

tggtgatgca gttgattatc agaaacagct gaagcagatg attaaggatt tagccaaaga 240 

aaaagataaa actgagaaag aattgcccaa aatgagccag agagaattta tccagttctg 300 

taaaactctg tacagtatgt tccatgaaga tccagaagaa aatgatttgt atcaagccat 360 

cgccacagtc accacactgc tgctgcagat cggggaggtg gggcagcgag gcagcagctc 420 

tggaagctgc tcccaggagt gtggggagga gctgcgggct tcagctcctt ctcctgagga 480 

ctcggttttt gcagacactg ggaagacgcc ccaggactcc caggcatttc cagaggcggc 540 

agaaggggac tggactgtct cccttgaaca tattttagct tcacttctga ctgaacagtc 600 

attagtcaac tttttttgaa aagccactgg acatgaaatc caaacttgaa aatgccaaga 660 

tcaatcagtn caatctcaaa cttttgaaat gaccncaatc caatctggac ntaagctgag 720 

tacttgtn ~" ~ ~ ~ 728 

<210> 699 
<211> 746 
<212> DNA 
<213> Homo sapiens 



<400> 699 

tttcaaatcn cttggctntt ngttcttttt gcaggatccc atcgattcga attcggcacg 60 

agggaaaaac aacaggtttg agtcctataa agccataatt taactccagt agctgatgtc 120 

agacaagctt gtcctatgtc ctatttgagt ggcagcagcg ccagcccagc aagaaggctg 180 

ggggttgtca aggttgtccc cagaccttgc ttgcagtggt tggagaaccc agggggctgc 240 

cttgggccct ctggccagag ggaagcgggc agctctagcc ctggagattg tggtcacatt 300 

ggggcttgtt taggattgga gggccaggtc acctccccag ccaccctccc ttctctcctc 360 

tggggtcccc actttagggc gactttgccc gagcccacgc atccatccac tcctttagtg 420 

ccttgaatct cattcacaag cagccccctc ccttcccctc cccttctcac tctgttgatg 480 

taatcctncc acccccagtg tccatcctaa gacaggcatc aaaaagaggc cctaacttta 540 

cttnccaaat ggtgcttttt aaaaaacacc atcactacat tangggcaat tttttcacac 600 

cttcctgtct tcagaatgta aaagggtggg ggaattattg tctctggtta aatntgcacn 660 

cccttgactt gtgggggttt tggggcatgt tcanntattt angaatgaat tncaattnga 720 

caaaaggggg tttantnaat tgttnt 746 

<210> 700 
<211> 759 
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<212> DNA 

<213> Homo sapiens 



<400> 700 



gntttgaaat ccctttgctt 
gattcgaatt cggcacgaga 
aagcagagga agagagattt 
ccgtctgcaa gccaagaaga 
gacctccagc ctccagaatt 
tggtattttg ttatggcagc 
aataagaaat tactacagag 
tttacctaag atttacagaa 
atgaaaaatg ttcatataaa 
gctccctgct gttcagatgc 
taagctggta gaagccagaa 
tgtgggtgtt tgacctttta 
nttttggcca naaacccaca 



tnaaatcctt tgctanttgn tctttttgca ggatcccatc 
taagggtggg gccttaattc agtagaattg gtggcctcct 
ttctttctct ctctgccatg tgaagacagt gaggagtcgg 
gcccttatca ggaacagact tggctagcac cttcatcgtg 
gcaagaaaat acatttccgt cgttgaaacc acccagtctg 
ccaggcagac taatacgtga agcctgctct aaatagataa 
ggctctttag aaattgtatt taaaaacaag acaatccata 
tgtatgtcta taaaaggagg gatttctgga ctagatgatg 
ggcaccttca gcttcgagtt gccaacacag gaggaagaat 
tgatatgtgt cctgtgcttt ctggatggcc agtgggatca 
ctttcatcca ctgacttcat attcttncac atnctggaac 
aaaaataaat ttaagcaaat tgaaatgntt tcctttgaga 
tnganatttt ncgtctncc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



<210> 701 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<400> 701 

gcttnnaatt ccnttccaaa gnaaaccctt tgnaaattnc cctttctgnt tggatcccat 60 

ccgattcgaa ttcggcacga gggtaagtca ggtgattgaa tcccggaant nttcattgtc 120 

ttcaagctca caatactatt ttgggacaaa cagttgtcta gtgtttggac tcatgaaccc 180 

tgattcttga gggtggtatt ttactgcttt tgtgatttgg tttcaacata tatagtcttt 240 

tctccggagt taccttaggt cagtggccag tgtttcagcc cctggaaagg gcatgggctg 300 

ccactgaggt tggtcacagg cctctcagct catggtggga gtgggttcag gagttggtaa 360 

gtagggttca gttctgttgt tgccaccgat ggcaacaggg gtttgtaata atccctagtt 420 

gtgtcaatta tgtcacttaa ttttcacaac aggtctctga agtgtttctc atctcatttt 480 

tacagatgag gcctgcctgt gttaatacac ctagtgagga gtggagctga atttgaatgc 540 

aagccttggc accttaattg agcaagtttg aaacctcgct tgttgccctt ctggaaggag 600 

tcangaattt ncagttctgg gcctgggctg tgggtctggc agacagacct ctggccctaa 660 

ggtttgggtn ccangttctc tgcttccaga atgagaagct ttgctgtgca ccaagnanct 720 

gggcccctct ggnatctcnt gaatnaaaan n 751 

<210> 702 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 702 

gntttgaanc ccctttnntt naaatccttt gctacttgnt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagcc tgaatataaa gaggaggagg aagaccaaga catacaggga 120 

gaaatcagtc atcctgatgg aaaggtggaa aaggtttata agaatgggtg ccgtgttata 180 

ctgtttccca atggaactcg aaaggaagtg agtgcagatg ggaagaccat cactgtcact 240 

ttctttaatg gtgacgtgaa gcaggtcatg ccagaccaaa gagtgatcta ctactatgca 300 

gctgcccaga ccactcacac gacatacccg gagggactgg aagtcttaca tttctcaagt 360 

ggacaaatag aaaaacatta cccagatgga agaaaagaaa tcacgtttcc tgaccagact 42 0 
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gttaaaaact tatttcctga tggacaagaa gaaagcattt tcccagatgg tacaattgtc 480 

agagtacaac gtgatggcaa caaactcata gagtttaata atggccaaag agaactacat 540 

actgcccagt tcaagagacg ggaatcccag atggcactgt taaaaccgta tatgcaaacg 600 

gtcatcaaga aacgaagtac agatccngtc ggataagagt taanggcaag gagggtaatg 660 

tgctaatgga cccgaactgt gacgatcctc atgtgatcat gaagtaccag tactgacttt 720 

ttatgttaaa aaatgtccat ttactgng 748 

<210> 703 
<211> 769 
<212> DNA 
<213> Homo sapiens 



<400> 703 

ggnnntnnna gnntttgaan tccctttnnt tctaatncta ggcttctngt tctttttgca 60 

ggatcccatc gattcgctca gctgaggcaa ttaaactgga aaagaaatag attgaaaaga 120 

tactacagaa gaagcagtac agaagttggg ggactgaagg agagggagcc actgcaggtg 180 

ctagctgctt aaggggatac cagtcctttt acagatataa tagatacagc ttctgaggtg 24 0 

gagggtgata ggagtgtgta gagaaattgc agttcagaac tggagcatgc agttaggcaa 300 

gaggcatccc atgtgaagat gtcaagcaag tactggaaaa tgctgaacta aaactcaggg 360 

atggatatgt agatttagag aacttcattg tagaggcagt cattgaaagc taaaagggct 420 

gataataaaa ttgccaagga tggaaatagt aagagggagt cagtgttatt aggattagaa 480 

ttctgttttg ttttttcttt aaacagattc tcgctctgtc accctggctg gagtgaagtg 540 

gtgtgatctc ggctcactgc ggcctcgacc tcccaggctc aagttatcct cccaactctc 600 

agccttccaa gtagctggga ccacagccat tcaaacacat gcctgcctta tgtttggatt 660 

tttttgtana aaccaaggtt ttgccatgtt tnccaggctg gnctnngaac ttctgggctt 720 

aagccattcc cccacccttg ggtctcccaa aatgctngcc attatangg 769 

<210> 704 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 704 

cnaannncnn ggnttcnaat annaggctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gagacccgtc cggggccggc caatttgcat atttggaatg cgccgctata 120 

aacccggctg gggttttgca gcgatttctt agatgtaaaa atgagatctc aatagcagcg 180 

ggctgggcac atcctctcct ctctccttct ctctctgccc ggagctggtt tccgtctctc 240 

ggctcggggc tggaactccg gcccaaccta ggcgcgcagc cgccacgaga tggcgcactt 300 

ccgatcaatg tcaaagccgc cggggagccg ggaaccccag catgattctt ggcctttgtt 360 

cgcttctgat actaagagca gcacggtaca ttatttcact tgtcccgctc cccttcataa 420 

cagaaaaagg ggactcaccc tcaagaagtg attggtatgg taatttaaag caacgcgcat 480 

tcgctaggcc tcgcgagcgt cgccgcgcgg agaagccagc tgtcccttgg cagtgatttc 540 

ggaaatgtgt caaggcaatt ccaaaggtga aaacgcagcc aactggctca cggcaaaaga 600 

gtggtcngaa aaaagcgctt gccccttaca cgaagcacca gacactggag ctggaagaan 660 

ggagtttctg ttcaatatgt acccttactc gaaaagcggn gcctagagaa taacccgcan 720 

cgttccacct taacggacag gacaagtgga aaaatcttg 759 

<210> 705 
<211> 777 
<212> DNA 
<213> Homo sapiens 
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<400> 705 

tttgaaatcc cntttnttna aatcctttgc tncttgttct ttttgcagga tcccatcgat 60 

tcgtcctgaa gctcgggggg ctgcaggtcc tgaggaccct ggtgcaggag aagggcacgg 120 

aggtgctcgc cgtgcgcgtg gtcacactgc tctacgacct ggtcacggag aagatgttcg 180 

ccgaggagga ggctgagctg acccaggaga tgtccccaga gaagctgcag cagtatcgcc 240 

aggtacacct cctgccaggc ctgtgggaac agggctggtg cgagatcacg gcccacctcc 3 00 

tggcgctgcc cgagcatgat gcccgtgaga aggtgctgca gacactgggc gtcctcctga 360 

ccacctgccg ggaccgctac cgtcaggacc cccagctcgg caggacactg gccagcctgc 420 

aggctgagta ccaggtgctg gccagcctgg agctgcagga tggtgaggac gagggctact 4 80 

tccaggagct actgggctct gtcaacagct tgctgaagga gctgagatga ggccccacac 540 

cangactgga ctgggatgcc cgctagtgaa gcttgaaggg tgccaaccgt gggttgggct 600 

ttcttaagca tggaggacat ttttggcaat gcttggcttt gggccattta aatgggaaac 660 

cttgaaaggc caaaaaaaaa aaaaaantna tntnaaaaan aaacttnnac cttttaaaac 720 

ttttaantgn ngnccgnttt tacnttanat tccagacttg attaggaatc cattttt 777 

<210> 706 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 706 

gntttgaaat nccnttnntt caaatnctng gctacttgtt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagna atgcaaaggg ctgcagttct cattcaggct actttcagga 120 

tgcacagaac atatattaca tttcagactt ggaaacatgc ttcaattcta attcagcaac 180 

attatcgaac atatagagct gcaaaattgc aaagagaaaa ttatatcaga caatggcatt 240 

ctgctgtggt tattcaggct gcatataaag gaatgaaagc aagacaactt ttaagggaaa 300 

aacacaaagc ttctattgta atacaaggca cctacagaat gtataggcag tattgtttct 360 

accaaaagct tcagtgggct acaaaaatca tacaagaaaa atatagagca aataaaaaga 420 

aacagaaagt atttcaacac aatgaactta agaaagagac ttgtgttcag gcaggttttc 480 

aggacatgaa cataaaaaaa cagattcagg aacagcacca ggctgccatt attattcaga 540 

agcattgtaa agcctttaaa ataaggaagc attatctcca cattagagca acagtagttt 600 

ctattcaaag aagatacaga aaactaactg cagtgcgtcc ccaacaagtt atttgtatac 660 

agtcttatta cagangcttt aaagttccaa aaggatattc aaaaatatgc caccgggctt 720 

gccacactta attcagncat tctatcnaat gccccagggc 76 0 

<210> 707 
<211> 856 
<212> DNA 

<213> Homo sapiens 



<400> 707 

gttgctttga agcctttgaa atncnttgtt tnaaatnctt ggctttngnt ctntttgcag 60 

gatcccatcg attcgcctcc ctggatgtgc agacatggag gaggacagaa ggcccagctc 120 

agtggccccc gctccccacc ccccacgccc gaacagcagg ggcagagcag tctggaggtg 18 0 

gtgntcccac ttgatgaaga gcaggcgact ggnttggaga gggagatcat gctggctgna 240 

aagaanggac tggacccata caatgtactg gccncaaagg gancttcagg caccagngaa 300 

gacccaaant tantncccta catntccaac aagagaatag naagctgcat ntgtgaanag 360 

gacaatacca gcntcnantg gttttggctn nacaaangcc angnccancn atgcccccnn 420 

tttgnaaccc attacaanct gntgccccan tagctggcac actgancncc tnntctaaat 4 80 

tacttaaaat natgctgtan aagtatantn tttncagaan agactaanca ntncatngnc 540 

tacttctcca aaaaaaantg anaaaaatna taaaantcaa antaaatact aaatnannan 600 
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ataanancan tannaantta tatttcnnan atantanann nancnnttta naannantta 660 

nggnnancan nnattantnn tnnatanntt acattaaant tattnanann anaaannnan 72 0 

nananannat atattannan anantnacnt aaactnnnnt naatnntcca nanacttnaa 780 

naanaataag nnntanatna nnnnttangn ntnatatann ttnanatann nnnnacnata 840 

nnacatnnnn tannga 856 



<210> 708 
<211> 766 
<212> DNA 
<213> Homo sapiens 



<400> 708 

ctaatactgg ctacttgttc tttcnaagcn ctggnntttn annnatnnag ctacttgttc 60 

tttttgcagg acccatcgat tcgcccaaac ttatcggggg tgccagaggc agagtagaca 120 

agccttagtg gccgccattt gttgaatatc tactgtgcgc caagcagtgc gtcacaactt 180 

tatgaagtag gtattattat catccccatt ttacaggtga agaaactgag tctctgagag 240 

accaactttt ccaaggtcac acagaggtgg gatccagccc acttccgtct gaccccaagc 300 

ccctgctgtt aacccctgcc ccattgtggg gaggttccgg cccactctgg agttctctgg 360 

tctgcgtcag tcctcaggag aagaaagaat gggggtgatg ctccaaatat tgaggctccc 420 

atctgtctgt cctgcactag gcagagccag gcttctccat ggggcacagg agagagggca 4 80 

ccagatctga ggagcaaata ggttcttggt ctgagatctc atgggatcag gttgccagcc 54 0 

ctgcaaaccc ccgctcangt ctagaggaca tggagctgcc tttcaaggtg catttgcttc 600 

ctttacagac tcggactctg tnctctggct actttgggcc gtcccggact cgggaatgcg 660 

tnctacactt gtaggggcaa aaccccggtt tgactctttc cgggttccta cccttaacca 720 

agcctttact ttctngggat caccctgttg ggactttttg tccacc 766 

<210> 709 
<211> 743 
<212> DNA 
<213> Homo sapiens 



<400> 709 

gaannccntt nnnttgcaaa tnntnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggttttt tttttttttt tttggagaat gaatgcaaga tttattgagt 120 

ggtggaagta gctctcagca gatggctggg gagccagaag ggggatagca tgggaaggta 180 

gtcttcctct ggagtctggc tgctcagcag ccgggatctc ctactgtcct tggccgaatt 240 

tcccttggcg tccgaatcgt tccaccatca atggcctgcc agcgtctttc gatgtgttct 300 

tctgccagtg tgttcctctt gacgtccagc cgcttgtgtg tgtgcccgct ggggtctcag 360 

ggtttttata ggcacagaat gggtggcatg gcaggccaga gtggtcttgg aaaatgcaac 420 

atttgggcaa gaagacagga gtccttgttc tcattaggtc catgggcaca agcctgaggg 480 

tggagccctt gccagtgacc ctgcccttct ctacccagca cttccctgtc cccctcccat 540 

atcaccgttg ccatcttgtc cttgatgagg aatacaactc ccaattcagt gnttgcttgt 600 

gggaagatgc aatcctcttt atgacaagtt tctaanaagt tgataagaaa aatggggacc 660 

tgcctaaggg ctagtatctc atttaatact ctatagaata ttatgnggtt ttccctttta 720 

ngttttaaat gttgaananc nan 743 



<210> 710 
<211> 753 
<212> DNA 
<213> Homo sapiens 
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<400> 710 

gnnnnnnnnn nagngtttga antcctcctt ngaaatcctt tggcnactcg ctctttntgc 60 

aggatcccat cgattcgaat tcggcacgag gggcaatgca gttataatac tgtgttaatt 120 

tcagacatct tctggtcctc cgagccttgt atttacatac tagctgaaac tgcaagtgga 180 

aatgaatgga gctgatgata tttgccttat cctaattttt ctgtgaggag gagaaaaaca 240 

cttgtgcttc aaataagcag atgtgaaaac acttctcact aatcaaaatg tttaccacta 300 

ggttatgaga gtctgcctct cataggcagt gaatctgata tgtatactta gtaatataag 36 0 

tctatttagt ttgacaaaac cttagagcag aatttttgca gcttagttca ggatgatcac 420 

tagcaatgcc aaacttcatt ttttattgaa cttggatcca agaaggcctg ctgtgtctat 480 

ttcagtatag actctcatac caatatattt atgctccaag tcactacacc cagaagtgat 54 0 

gcagtggggg aaatgcaaag acaacatcac tgtaagattc acagaatgga tcttttgtaa 600 

aatattttat attgacttaa ggaaaacctt tcattgggaa ttaattaaat taagtctcta 660 

atatcctgga agacagtaaa aantnaagcn ggtgntctca antttgaacc cggcnattng 720 

naatttcatt ataggaattt ctgaaaataa tec 753 

<210> 711 
<211> 718 
<212> DNA 
<213> Homo sapiens 

<400> 711 

naatngctag gctacttgtt etttttgeag gatcccatcg attcgaattc ggcacgagcc 60 

tacttattgg atgttggctc tttggtgtca tggagatggc tttactgtag gtttgttgtg 120 

ttgeattact tttcattggg attgaactga gaaataacaa acaagcttta agtgggaaat 180 

taaaaaaaag aagtaaccta tgtagatcca aacttaaaat gtgagaaatt attgaaattt 240 

cattttctac aaacttgaaa ttagectget aattgtaaag ttgttttaat aatgetgaca 300 

aatgtcagtt acgtttgcaa aggagtgtat ggttctaggt atttgectae tgttaaccgt 360 

tgagaaaaac attgtcaggt tagcaagtct attgaaatag agacctcctt agtttacagc 420 

aaagaataaa tagctgatga ctggagattg ggactaaggt tttatttatt tatattcttt 480 

gaaagaaatc ggacagttaa taagtggttt gtggtagagt tgaaggatgt ctgagagatg 540 

gaaagagagt gacaaaggag gagaaggaat agtatttctt ttttagtatt gntttgaaat 600 

taaaactctg ntattttaat atggtaaaga gcaagaattt gggttgggcc gengtgaetc 660 

aegectataa tcccagcact ttgggaagcc ntggtgggca aatcacctga aattangg 718 

<210> 712 
<211> 783 
<212> DNA 
<213> Homo sapiens 

<400> 712 

agttgaantn ettgetaenn aaaacctttg genactnget ctttntgnag gatcccatcg 60 

attegcaaag atggtcgtat tactaaaggt gaataaccag cgeggnnnge acgtggagtc 120 

actggaacat ttgtgcaatg ctggtgggaa tgtcaacccg tgcggccctc tggaataagc 180 

ctggcagctc ctccaagagt taccgngtga cccancaatt ccactcctag ctccacccac 240 

aggaattgaa agcaaanacg caaacagatg cctgtncacc aaagttcacg geagcatnet 300 

tcgncatagt ggcagcatcc gtcgtcacag cggcatcatc cttcatcata geggcagcat 360 

ccgtcgtcac aageggcage atccttcgcc acagnggcan gcatctgtcg teacaneggn 420 

agcatccttc gaeaaagegg cagcatnett cgtnatagcn gcagcatcct ttgecatanc 480 

eggcaaggtg gaaaccctgt ccatccactg aggegtgeat agactaaaca tgggcagtcc 54 0 

agcactggaa ttccaagccg tacaaeggng nccacngtca aaaangaatg aggaccctga 600 

ngcacctgng cnganaacaa gaaenngega nnccaanact tttnagacat tattgectta 660 
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agtngaaaaa cccagngcac caacgggaaa ccngaccgnc ntgnanccct gnttaacntt 720 
nantnngttn cccgaaaatg ggggcacntt nccaaaaagg ggaataaaag gggagaattn 780 



cct 



<210> 713 
<211> 765 
<212> DNA 

<213> Homo sapiens 



783 



<400> 713 

gttgaantcc ttcctttcaa atngcttggc tactcgntct ntntgcagga tcccatcgat 60 

tcgaattcgg cacgagccca catgtaccag gttgagtttg aagatggatc ccagatagca 120 

atgaagagag aggacatcta cactttagat gaagagttac ccaagagagt gaaagctcga 180 

ttttccacag cctctgacat gcgatttgaa gacacgtttt atggagcaga cattatccaa 240 

ggggagagaa agagacaaag agtgctgagc tccaggttta agaatgaata tgtggccgac 300 

cctgtatacc gcactttttt gaagagctct ttccagaaga agtgccagaa gagacagtag 360 

tctgcataca tcgctgcagg ccacagagca gcttgggttg gaagagagaa gatgaaggga 420 

catccttggg gctgtgccgt gagttttgct ggcatangtg acagggtgtg tctctgacag 480 

tggtaaatcg ggtttccaga gtttggtcac caaaaataca aaatacaccc aatgaattgg 540 

acgcagcaat ctgaaatcat ctctagtctt gctttccttg tgagcagttg tctttctatg 600 

atccccaaag aagtttttct aaagtnaaaa ggaaaattcc tagtggaatt cancccccaa 660 

gggaaaaaag cccacttgnc cacannagga agccnggntn ccccttngtt ccggcttaan 720 

ggccccttgt tcaggaaacc acactggggg ancttntttt ttttn 765 

<210> 714 
<211> 740 
<212> DNA 

<213> Homo sapiens 



<400> 714 

gtttgaannc cttngntttc naatgctngg ctacttgttc tttntgcagg atcccatcga 60 

ttcgccaaaa gcttgtggca aatttgaaat ttctgccatt agggacctta caactggcta 120 

tgatgatagc caacctgata aaaaagctgt tcttcccact agtaaaagca gccaaatgat 180 

caccttcacc tttgctaatg gaggcgtggc caccatgcgc accagtggga cagagcccaa 240 

aatcaagtac tatgcagagc tgtgtgcccc acctgggaac agtgatcctg agcagctgaa 300 

gaaggaactg aatgaactgg tcagtgctat tgaagaacat tttttccagc cacagaagta 360 

caatctgcag ccaaaagcag actaaaatag . tccagccttg ggtatacttg catttaccta 420 

caattaagct gggtttaact tgttaagcaa tatttttaag ggccaaatga ttcaaaacat 480 

cacaggtatt tatgtgtttt acaaagacct acattcctca ttgtttcatg tttgaccttt 540 

aaggtgaaaa aagaaaatgg ccaaacccaa caaactaaca ttcctactaa aaagttgagc 600 

ttggacatat tttgaatttt tgtaagtgaa agatttttaa actgactaac ttaaaaaaat 660 

agattgtaat tgatgtgcct taatttgcat aaatcataaa tgtatgtcct ctctgtaatt 720 

ggtttaatgt gtgcttgaan 740 

<210> 715 
<211> 708 
<212> DNA 

<213> Homo sapiens 
<400> 715 

tttgcaaatn gcttggctac ttgttctttt tgcaggatcc catcgattcg aattcggcac 60 
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gagggaggct agactcaagc tgtctggaga gtgtgaaaca aaagtgtgtg aagagttgta 120 

actgtgtgac tgagcttgat ggccaagttg aaaatcttca tttggatctg tgctgccttg 180 

ctggtaacca ggaagacctt agtaaggact ctctaggtcc taccaaatca agcaaaattg 240 

aaggagctgg taccagtatc tcagagcctc cgtctcctat cagtccgtat gcttcagaaa 300 

gctgtggaac gctacctctt cctttgagac cttgtggaga agggtctgaa atggtaggca 360 

aagagaatag ttccccagag aataaaaact ggttgttggc catggcagcc aaacggaagg 420 

ctgagaatcc atctccacga agtccgtcat cccagacacc caattccagg agacagagcg 480 

gaaagacatt gccaagcccg gtcaccatca cgcccagctc catgaggaaa atctgcacat 540 

acttccatag aaagtcccag gaggacttct gtggtcctga cactcaacag aattatagat 600 

tctaatctga tgagttactg agctttggtc ccttaaaaca agctgacttg gtccctaaac 660 

cagatgaaaa tccagatgct ctatacttgg ctttaagaac tgctttcn 708 

<210> 716 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<400> 716 

ttgcaaatng ctnggctact tgttcttttt gcaggatccc atcgattcgc tcccatggag 60 

gtggtgggaa tggcaccgag aagtttgatg acagttatct aatggactag aggttggcaa 120 

actttctgta aatggccagg tagtaaatag ttctgctttt gaaggcatat ggtctcttgc 180 

acctactcga ggctgaaagc agctatagac aatacataaa tgaatgagcg tgagtgtgtt 240 

ccaataagaa aaaaacatgg ctgtttgctt cggccccagg gttgtagctt accagtcctg 300 

taacagatca cagtttgctc ttttggtcac aaatacttga acccctccct agttcagagc 360 

atttgatacc gtaatattta aagctcactt gtaaaacatc gtttgttgcc tccatccata 420 

gtatctcaaa cagaatgtct ctcccaaata tacctaaatt ccatattctc tgaagcacaa 480 

ccagctattt tcttgacata cttcctaaca caccccacag ttcacaattt gatctgaaaa 540 

cttgttaagg gaggttcttt ggcatgtgat gccataaaaa gagaggtatg ggctctcctt 600 

taaaaaagag acccttttta tgagactcac aataggataa aagagcccat gcctattttt 660 

aaacattttt ttcactatat aagacatgca tgcctgnaaa atggttttta attagtatna 720 

ntgcttaatn 730 

<210> 717 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 717 

naatngctng gctcttgttc tttttgcagg atccctcgat tcgctgcagt gagattctct 60 

gcaatgactg gcctcagcaa gggggcagct taggaccctg acatcccagg tcactaagcc 120 

acataggata agtaatgggt ggacagaagc gggaaaggag aagggcaggg cacatgttta 180 

aaacttgaac tttctgaggc taagactgga aaaggaatgg tttcagctga tatatttgga 240 

taccagttga ctatttttag gaaaaaaaca caaatggctt ttaaacatca cagtgtgata 300 

cagtctaact cagaattaga gacaggcaaa acagaactcc atcttaaaaa ataaataaat 360 

aaaataaaat aaatgacatc actttggttc agagctctaa aatggaggga ggaagccatt 420 

ctaaaaagga ctccctacat gacctgcaac ttgaaaaaaa attaaaagct ccaaaaaaaa 480 

caatncagga gcttaccttg aaccttttga attgggccaa attgccgatg accactgcat 54 0 

cctggaaaat tttatttcac cagcactaca acttctcaac agcaccaacc aatttaacta 600 

tggatttttg tactaanccc agttgcctct ttnaaaacaa cttgtcaact ttgtctaatc 660 

accctcagct tttttttaaa aacccctnct ctacccctnt ctcttcagaa caccaaagtg 720 

gncttttn " ~ 72 o 
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<210> 718 
<211> 730 
<212> DNA 

<213> Homo sapiens 



<400> 718 

gaantccttn nntttnaaat cnttggctac ttgttctttt tgcaggatcc catcgattcg 6 0 

aattcggcac gatctagata ttgcccaatc gctgcccaca gtgcacatac ctttccacca 120 

gtcacatgtg agagggcaga ttttccaaat gctcatcacc acttggcact gtgtggacta 180 

taattttggc cagttaggaa atggcatctc attgttttca tcttaatttg cgtcagcctg 240 

attactcatt gaaacttgtg aggttgagaa acttttctta agcttattgg ccattcaagt 300 

ttcctccttt atgaaatggt tgttcatgtc atttgctcat ttttatatta gattgttttt 360 

cttttttcca gctgacttgt aggaactcta catcttatca atattaatca tttatcgaaa 420 

actatttggg tgccattatc ttctcctagt caatgttttt tgtttgtgat atcttttata 480 

atatataagt ttttaatgtt ggcagaagta aagttaatct ttttggctgt gttgtgtgtc 540 

ttgtttgatg taaagatagt ttctgtaata gttttgcagt ttgattggtc atctttaggt 600 

cttcaattac aacctgcaca ttcatccctc tatcctcttt cttactctgg ttttctccat 660 

agcacttatc atccaataat atggcatgca cttatttaat ctggtttgca tatatatttt 720 

ngctggtacg 730 

<210> 719 
<211> 733 
<212> DNA 
<213> Homo sapiens 



<400> 719 

ttcaaatcgc ttggctactt gttctttntg caggatccct cgattcgctt cagtgcacac 60 

aacaggagag aggagaaaga agaaacgcta gtaattccaa gcactggaat taagttgcct 120 

tcatcagtgt ttgcttcaga gtttgaggaa gatgttggat tgttaaataa agcagctcca 180 

gtttcaggac ctcgactgga ttttgatcct gacattgttg cagctcttga tgatgatttt 240 

gactttgatg atccagataa ttctgcttga ggatgacttt attcttcagg ccaataaggc 300 

aacaggagag gaagagggaa tggatataca gaaatctgag aatgaagatg acagcgagtg 360 

ggaagatgtg gatgatgaga agggagatag caatgatgac tatgactctg caggcctatt 420 

gtcagatgaa gactgtatgt ctgtgcccgg aaaaactcac agagctatag cagatcactt 480 

gttctggagt gaggaaacaa agagtcgctt cacggagtat tcgatgactt nctcagtcat 54 0 

gaggagaaat gaacagcttg accctacatg atgagangtt tgagaaagtt ttatgagcca 600 

tattgatgat gatgaaattg ggagctctgg ataatgccag aatttggaaa ggttctattc 660 

aagtgggaca gcaattcgct ttcnaggaag ttttgaatga ctactattaa agagaangcc 720 

caanaattnt ntt 733 

<210> 720 
<211> 740 
<212> DNA 
<213> Homo sapiens 



<400> 720 

agttnnnttn ntnctnttca aatccttggc tacttgntct ttttgcagga tcccatcgat 60 

tcgaattcgg cacgagaaga gaaggaccta gagattgaga ggcttaagac gaagcaaaaa 120 

gaactggagg ccaagatgtt ggcccagaag gctgaggaaa aggagaacca ttgtcccaca 180 

atgctccggc ccctttcaca tcgcacagtc acaggggcaa agcccctgaa aaaggctgtg 240 

gtgatgcccc tacagctaat tcaggagcag gcagcatccc caaatgccga gatccacatc 300 



222 



WO 99/33982 



PCT/US98/27610 



ctgaagaata aaggccggaa gagaaagctg gagtccctgg atgccctaga gcctgaggag 
aaggctgagg actgctggga gctacagatc agcccggagc tactggctca tgggcgccaa 
aaaatactgg atctgctgaa cgaaggctca gcccgagatc tccgcagtct tcaacgcatt 
ggcccgaaga aggcccagct aatcgtgggc tggcgggagc ttcacggncc cttcaccagg 
tggaggacct ggaacgcntg gagggcataa cngggaaaca gatggagtcc tttctgaagg 
caaacattct gggtctcggc ggccgccanc gctntggcgc cttctgaccg tcgctnctac 
ttncgncttt tcaaattttt ggnataaccc ccgtgtttgn gtaaaatcca gtttttgttc 
cgntaaaaaa aaaaaaaaat 



360 
420 
480 
540 
600 
660 
720 
740 



<210> 721 
<211> 736 
<212> DNA 
<213> Homo sapiens 

<400> 721 

nnttnnnttt tnnaaatccc ttggctactt gttctttttg cagggatccc atcgattcgc 60 

atgagtgata ttttggtctg ggtttcctct taagatttta gtttgtctga attaaggaaa 120 

aatgttttta atatacattc ttattttgtc ccacccctcc agaaataagc tggaaatctt 180 

aactttttgg ggggtctttt ttggtgtttt aatgggccca gaactgtggt ttaaattttt 240 

atgtatgtat tttctttttt gtggagtata aatttaaaaa ctggatttgg gacctaaaat 300 

actcctcagg ttgatgtatt catgaagttt taaaacatct ttagttttca aagtaaactg 360 

gatatgtgga ccttaaagtt attgagttta agctacaaat tgtaacgtca ttactggaca 420 

tgtcagcatc aaccctctca aaatagcttg gtcactttat gaaggggcgt tttaaagttg 480 

ttgtttagca gtgacattta atatggtcca attgcttttc tttttaacgt gacaaaaaga 540 

gaataaggaa caaacactat tgctgccgaa tgccataaca ctgagttgtc aaattgtgat 600 

tgaggaaatg aaaaggttta tactttttaa aaaaaaaaaa cnnaanccaa aaaaccaaaa 660 

cttcaaatgg aataaattat tcatgaagcc cttaaaaaaa aaaaaaaaaa aactcgaacc 720 

tntaaaactn tngngg 736 

<210> 722 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 722 

attnccttgg cttttcaaat ccttggctac tngttctttn tgcaggatcc catcgattcg 60 

aattcggcac gagattatag agattaatct cctttgctcg aagtctattt aaatattagt 120 

cacatctaaa acatactttt acagcaacat ctagactggt gtttgaccaa acaactgggc 180 

atcatagctg acacataaaa ttaaccatca caaccatgtt ctaggcactg ttcctcactg 240 

cctgagaaga caccgttatg tttattaggg tttttgagtt ttatccacag cttttggtta 300 

tctgcaacca tgtctcccac cattaacata gttcacactg agatgaggat tccctattta 360 

acacttggtc ccaacttctt cacagtccat ctggttttgt agagggaaca taactggaca 420 

ttctggtcag gttaggtgag gtcaggcctt caggacgcta ttttcactga gttgctttat 480 

aaggcacatt atgcaaaatt ccatcagctc ttctgttcac tacattcact gttgaaattc 540 

taagagtgag actgctgtct cacaccaaag ccagtgggta ctatcttcag taggcacgca 600 

gcatcatgtt tgtatttgat ccanctagat gacatgtaag agaaaacttt attgnggact 660 

ctgtaaagtg tgacattcgt ttgtgactca atttgctcat gtatttggtc ctggggagtc 720 

attacatagc taactttcag ctgctttcaa t 751 



<210> 723 
<211> 749 
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<212> DNA 

<213> Homo sapiens 

<400> 723 

tttaatncct ttcnntaatc cttngtttcn ngcnctttnt gcaggatccc atcgattcga 60 

tgctagccaa agcctgctgc cagctccata gcctggacct acagcactcc atggtggagt 120 

ccacagctgt ggtgagcttc ttggaggagg cagggtcccg aatgcgcaag ttgtggctga 180 

cctacagctc ccagacgaca gccatcctgg gcgcactgct gggcagctgc tgcccccagc 240 

tccaggtcct ggaggtgagc accggcatca accgtaatag cattcccctt cagctgcctg 300 

tcgaggctct gcanaaaggc tgccctcagc tccagcctgg accttgcccc caggtgctgc 360 

ggctgttgaa cctgatgtgg ctgcccaagc ctccgggacg aggggtggct cccggaccag 420 

gcttcctagc ctagaggagc tctgcctgnc gagctcaacc tgcaactttg tgagcaacga 480 

ggtcctnggc cgnctactcc acggctctcc caacctgcgc ttactggatc ttcgtggctg 540 

tgcncgcatc acgccggctg gccttcagga tctgccatgt cgggagctgg agcagcttca 6 00 

tctgggcctg tatggcacgt cagaccggct gacttttacc aangagggca agnccctttt 660 

gaccagaant ggtgcataca ctgcgaagaa ctggactttg aatggccaag ggttcaattg 720 

agaaagacct ggaacangcc cttgctnct 749 

<210> 724 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 724 

ttnnnnccct ttttaatncc ttctactaat ccttggctct cgntctttct gcaggatccc 60 

atcgattcga attcggcacg agcctcagcc ttctaaaaag ctggggctac acccagctga 120 

agaaattgta actaaagata gattgtttaa agcaaagcaa gaaacttctg aagaaatgga 18 0 

acaaagtgga gaagcctcag gaaagcccaa cagagagtgt gcaccccaga ttccttgtag 240 

tactcctatt gctactgaaa ggacagttgc acatttgaac actctgaagg accgtcaccc 300 

aggtgatttg tgggcccgca tgcacatctc atccctggaa tatgctgcan gagacattac 360 

ccgaaaaggg agaaaaaaag acaaagctcg agtgagtgaa ctgctccaag gcctctcatt 42 0 

ctctggtgac tcagatgtgg aaaaagataa tgagcctgag atccagcctg ctcaaaagaa 48 0 

gttaaaggta tcatgtttcc cagaaaagag ttggaccaaa agagacatta aacccaattt 54 0 

tccaagctgg tcagcactgg attctggact tttgaatctc aagagcgaaa agtttgaacc 600 

cagtagagct ttttgaatta ttttttgatg atgaaacatt caacttaatt gtcaatgaaa 660 

ccnataatta tgcttctcag aaaaatgtca gctttggaag tccagttcag gaaaaaaaan 720 

nnnnannaaa aaactcgagc ctntanaact atngtgagtc c 761 

<210> 725 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 725 

tttcnccccn tttttanccc cttnctctaa tccttggntc tngttctttt tgcaggatcc 60 

catcgattcg aattcggcac gaggcggact ctcaggacga aaagagtcaa acctttttgg 120 

gaanttcaga ggaagtaact ggaaagcaag aagatcatgg tataaaggag aaaggggtcc 180 

cagtcagcgg gcaggaggcg aaagagccag agagttggga tgggggcagg ctgggggcag 240 

tgggaagagc gaggagcagg gaagaggaga atgagcatca tgggccttca atgcccgctc 300 

tgatagcccc tgaggactct cctcactgtg acctgtttcc aggtgcctca tatctcgtga 360 

ctcagattcc cgggactcag acagagtcca gggctgagga actgtccccc gcagctctgt 420 
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ctcccttgct agagcccatc agatgctctc accagcccat ttctctactg ggctcctttt 480 

tgactgagga gtcacctgac aaggaaaaac ttctatcagt actttgatat gtcacagttt 540 

catgtttatc cagttcaatg tatttttaaa tttttccttg agacttcttt gactgataga 600 

ttattgtgaa gtgtgttttt aaatttncaa atgtttangg attttcatat ctttcttatg 660 

ctgatttcca attggattcc ttacaatgat ttttgggttt catctgctct tggatgatta 720 

ctatctcttt taaatttggt gtggccaagt tttagggccn . 760 

<210> 726 
<211> 741 
<212> DNA 
<213> Homo sapiens 



<400> 726 

ttntgccctt tgtntnatcc ttgntcttgc ctttttgcag gatcccatcg attcgaattc 60 

ggcacgagac aagttctatt gagtgctatt cagaatagga acaaggttct aatagaaaaa 120 

gatggcaatt tgaagtagct ataaaattag actaatctac attgcttttc tcctgcagag 180 

tctaatacct tttatgcttt gataattagc agtttgtcta cttggtcact aggaatgaaa 240 

ctacatggta ataggcttaa caggtgtaat agcccactta ctcctgaatc tttaagcatt 300 

tgtgcatttg aaaaatgctt ttcgcgatct tcctgctggg attacaggca tgagccactg 360 

tgcctgacct cccatatgta aaagtgtcta aaggtttttt ttggttataa aaggaaaatt 420 

tttgcttaag tttgaaggat aggtaaaatt aaaggacatg ctttctgttt gtgtgatggt 480 

ttttaaaatt tttttttaag atggagttct tgttgcccag gctagaatgc aatggcaaaa 540 

tctcactgca atctcctcct catgggttca agcaattctc ctacttcagc ctcccaagta 600 

gctgggatta caggcatgtg ctaatttggt gtttttaata gagatgaggg ttttccatgt 660 

tggtcangct ggtctcaaac tcctgcctta ngtgatcgcc tcggcctnct aaagtgctgg 720 

aattcaggca tgaancncca t 741 

<210> 727 
<211> 751 
<212> DNA 

<213> Homo sapiens 



<400> 727 

ccttcttccn aangctntgt tgaacncctt tcnnnatcgc gcttgcgctt tgagctagga 60 

taaaaattgg gtaaagggac atttgcttac ctgnntnatg aatcactntt tgaaatgtga 120 

tcttgccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatna agtttttctg cttagcagac agaaggtata 240 

attntttgac accctttccc acctggtgcc tatgctaggc ttgtcctgan aacatncctc 300 

agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctattcc tgctactcgg ttaatgagct tggcaggttc 420 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagnggcc tgtgctaacc cagtggtaaa tcccttagat cccctgctgg tctctggcaa 540 

aactccttga tnttgggtac catgtatant ttgcctttga cntttaacgc tttctacgat 600 

anggtaanca cncntttaat ttangcnctg gancattaac tttctttgca aaggctactt 660 

atngccngnc acaantgcag cctcggacan ancnnangnn atatcctgtt ggccatggct 720 

ntgatgtttg acanccgata ngccttctnc g " " 751 

<210> 728 
<211> 765 
<212> DNA 
<213> Homo sapiens 
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<400> 728 



tngnntttnt ttaacnttgt 
cctgcagatt gcagagcggc 
ggacatgctg gagcagatgg 
cctgaactct ggaggagaag 
tacctgtcac aatganatta 
nccttcttnt tcctncacct 
gtnccncnnt gttcagtccg 
acacactnac agagangcca 
atgggcgtng tntggccaca 
aagccaggtt nnttnataga 
aacttgtgng aacantttaa 
natnnagggc taaacaagct 
ttctnaaaat nttcaatntn 



ttgacgcctt tctgcaggat ccctcgattc gcactggcta 
gagagcccat aggcagcatg tcatccatgg aagtgaacgt 
acctgatgga catatcggac cangaggccc tggacgtctt 
agaacactgt gctgtccccc gccttagggc ctgaatccag 
ccctccaggt tccaaatncc tcagaattaa gagccaancc 
gcaccgactn ggncacccng nacatcanng agggtgggga 
atgaggagga anttcangtg gacactgncc tgnccacatn 
ctcnngatgg tgntnangac agcaactntt aaattgggac 
ctggaatcca untttggctg tatgcggaat ttcacctgcn 
cgttcttgat tattacataa ttgccaatca tgtggtgagn 
caattaantg tgaagaccgt acaangaatt agttaaangc 
attacttntg annnaantta angnatntaa nntttnctgn 
nngggaacan ttgtaanttt nncnt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 729 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 729 

tannnnttnc tntannnttt ctgangccct tntgcaggat cccatcgatt cgaattcggc 60 

acgaggagat ctctgggatg tcagtgaggc tggttgaaga ccagaggtaa actgcagagg 120 

tcaccacccc caccatgtcc caggtgatgt ccagcccact gctggcagga ggccatgctg 180 

tcagcttggc gccttgtgat gagcccagga ggaccctgca cccagcaccc agccccagcc 240 

tgccacccca gtgttcttac tacaccacgg aaggctgggg agcccaggcc ctgatggccc 300 

ccgtgccctg catggggccc cctggccgac tccagcaagc cccacaggtg gaggccaaag 360 

ccacctgctt cctgccgtcc cctggtgaga aggccttggg gaccccagag gaccttgact 420 

cctacattga cttctcactg gagagcctca atcagatgat cctggaactg gaccccacct 480 

tccaactgct tcccccangg actgggggct cccangctga nctggcccag agcaccatgt 540 

caatgagaaa gaaggaggaa tctgaacctt gggtaaggat ttggggcaca gtaccaggaa 600 

gggggcttgg tgccagacct tatgaggaag aaggattttc ctatgtacag agaangggac 660 

cctgtnctgt tgggaagtgc ttgtgcaaac ctaaccaagt tactaacccc tctgntttct 720 

gtgctacaca aaggggataa att 743 

<210> 730 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 730 

tttnttcctt cctctaatcc ttttancgcc tttctgcagg atcccatcga ttcgaattcg 60 

gcacgagggg tcctccaaga gtttggggcg cggacnnnag taccttgcgt gcagttatgt 120 

cggcgtntgt agtgtntgtc atttcgcggt tcttacaaca gtacttgagc tccactccgc 18 0 

agcgtctgaa gttgctggac gcgtacctgc tgtatatact gctgaccggg gcgctgcagc 24 0 

acggttactg tctcctcgtg gggaccttcc ccttcaactn ttttctctng ggcttnatct 300 

cttgtgtggn tgagtttnat cctagcggtt tgcctgataa tacngatcaa cccacngaac 360 

aaagcngatt tccaaggcnt ctgcccagag cnagcctttg ntgannttct ctttgccagc 420 

accatcctgc accttgttgt natnancnta ggtgnctgaa tcattctcan ttncntaatt 480 

gangagtang anactaaaag aatgttgact ctttgaatct gctggataag agactngaga 540 

tggcagctta ttggacacat ggattttctt cngatntgca cttactgcta gctntgctan 600 
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ctatgcagga gaaaagccca tagttactgc gtgtnacaac aactntctaa cnaacattca 
ttaatccann ngannccttt caangaatgg taancctatg ccnttcaana tactgaactt 
nntgccactt ntggcaaaaa aaat 



660 
720 
744 



<210> 731 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 731 

cttattccct ttgnaactna ctctttntca tccctttgtg caggatccca tcgattcgaa 60 

ttcggcacga gtgtccttat ctgaaattca gcgatcttnt tgaataagca tttctctgat 120 

tgtggtatat gcctttaatt ttatttctag agtgacaaat ttttggtttt gacagttttt 180 

ttctagcttt atagtttctt cttggggaga gaatatgtca acctcactcc atcatgctga 240 

agtaaatctt catctcttaa ttttatctct caaaaatatc ctaaggattc cctctggagc 300 

ctgataagta attgcagtat ctggtttcta tggttggatg attcaggatt ccaggaataa 360 

tagttacttt ttagacctct aaagaagaag taacaaccac gtaaatgaaa agatgcttct 420 

taaatcatgg agaatcaggg cttagtatca ctgtattttc aaactgtttc agccttactt 4 80 

tataactgat ttagtatatt tttcttttaa tttcagactt cagtgaagtt ccttatgact 540 

tcccctgaaa ttgcttcctt atcatggggg caaatgaaag taaaaggctc taatacaacc 600 

tataaggact gcaaagtatg gccagggggt agtcngactt gggattggag agaaacagga 660 

actgagcatt ctcctggtgt gcacctgcag atgtgaagga agttgttgag aanggtgtcc 720 

agactcttgt gattggncna nggata 746 

<210> 732 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 732 

ttnnnnncnn nnatcctttn gatttnattc ctntntcang tcctttgtgc aggatcccat 60 

cgattcgaat tcggcacgag gtggcccata agttttacct tttaaacatc cggctgcctg 120 

tgaatgagaa gaagaaaatc aatgtgggaa ttggggagat aaaggatatc cggttggtgg 180 

ggatccacca aaatggaggc ttcaccaagg tgtggtttgc catgaagacc ttccttacgc 240 

ccagcatctt catcattatg gtgtggtatt ggaggaggat caccatgatg tcccgacccc 300 

cagtgcttct ggaaaaagtc atctttgccc ttgggatttc catgaccttt atcaatatcc 360 

cagtggaatg gttttccatc gggtttgact ggacctggat gctgctgttt ggtgacatcc 420 

gacagggcat cttctatgcg atgcttctgt ccttctggat catcttctgt ggcgagcaca 480 

tgatggatca ncacgagcgg aaccacatcg canggtattg gaagcaagtc ggacccattg 540 

ccgntggctc cttctgcctc ttcatatttg acatgtgtga gaaaggggta caactnacga 600 

atcccttcta cagtatctgg actacagaca ttggaacana gctggccatg gncttcatca 660 

tcgtggctgg aatctgcctc tgcctctact tcctgtttct atgcttnatg gnatttcaag 720 

tgtttcngac atcantggga agcaatccac ctgccn 756 

<210> 733 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 733 

cntatccttt nntttattcc ttnataagnc cttnngcagg atccatcgat tcgaattcgg 60 
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cacgagctca cacctgcttt ggatgcttca agcacctcag ccctctgaac tacaaaacag 
aagagcctgc aagtgacaaa ggaagtgagg cagaggccca catgccccca ccgttcacac 
cctacgtgcc tcggattctg aacggcttgg cctcggagag gacagcactg tctccgcagc 
agcagcagca gcagacctat ggtgccatcc acaacatcag cgggactatc cctggacagt 
gcttggcgca gagcgccacg ggcagtgtgg ctgctgcccc ccaggaggcc tgaggctggg 
tctcactgct ctgaaaagac acaaccagaa tggcctgggg ctcaggccct tggctgagtg 
ggaatgcgtt gggactgccc agctgagcta tcaggtgccc atcttttctg gtcccagcag 
tggtgaggag agcacaggca ggcctcgccc ctcccttgct cacccagttt cccctncggc 
acaagcttcc agctctgcag ctggggtgac atccccagtg gtttgtcgcc aagacatgtg 
gtggactttt cgccccccaa actgatgagt nccggagaat atatggagag agagatgtaa 
aaaaaaaaaa nnnnnnnnnt nntimnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnna annnnananc tc 

<210> 734 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 734 

nntanaatcc ntttnnctnt aatccctcta ncaaatccct tgggcaggat cccatcgatt 60 

cgaaaattta tagtaatgac aaatgactta tcagtgttca tcatctgaaa gctaagtggt 120 

tcgttcaatc actttttcaa agttgatagt agattgcatg gtttcatgtt tcctcatatt 180 

ggtttattaa ttctatttaa tcaaggaaaa taacttcaga ttccataaag tttcagttta 240 

tttttagttt actactaggt gagatagcac attacatact tttactatca aatattattt 300 

tagcagcttc ccatagtacc aaatgatttg attccctact ctcatttttt aaagcatata 360 

aatatttatg ggcttaaaaa gggggttttt aaaaactgag gatatcagta ataaattgca 420 

gaatattttg caaagctttc ttttggaaag caaacttttg tgcctgccta tatgcaaagt 480 

attttatcag ggacttgaac aaagacctca ctctttttca cttgtcttat gtcgagagaa 540 

aagggtattg gcagccacat tcctaagact ggggaatggt gtgtcccttt aaatttgaag 600 

ataactttan gtaattatng gaactcctca aagaggagaa agtaattttt tncagacatt 660 

ttctcaatct gggnctttca cacactantt tncatagtcg agaatctggt tttacccatt 720 

gggctgngaa tgtccaatat cagtcctgg 749 

<210> 735 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 735 

gngntnngnn gttnnnttnt tttnaatnta atccttgtnt naantccttt tgcaggatcc 60 

catcgattcg aattcggcac gagggtgccc atcaccacac ccagctaact tttgtatttt 120 

tagtagagac ggggtttcac catgttggcc aggctggtct tgaactcctg acctcgtgat 180 

ccgcccgcct tggccccgca aagtgctggg attacaagca tgagcccagc gcctggctgt 240 

atctttcatt ttacccaagt cactttaccc aagtaagtaa ttaggggaaa gcctgagtct 3 00 

tgtaccacct gttcatttgg ggaactgtgg gaaacggagc caacggacct aagtgccctt 360 

tgacagtgag tttcatacca tttcagtagt gtatttcttt cttaatctga ataaaccaga 420 

atgatactct cagcacagaa gaataaaggg agcgagtcat taacgttntc tttttaaacc 480 

tttatgatga cttncttatg aattactgaa cgaacactgg aatgggactc acgtatcctg 540 

aggacatctc tcaactctgg ccttantttc ccctctgtaa aattagggtg ccaactaaat 600 

gatctacaag gtccctttnc aagcgcccgn cattctgtaa ttacatcatg tggaactgna 660 

ttaaacatac accagtgaac tggcangcat tgggaatgta actttcccag taaaatgctt 720 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
742 
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tnggtttggt tcaaaataca ctntgaactt cttttcaaag acnggttnng 770 

<210> 736 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 736 

tttnnctttt attcaaatnc ttgcnggatc ccttgattcg aattcggcac gagggatgnc 60 

catcgatgct natcnggcac gaggtgatgn cagcttgcaa actggtctac atnncaaact 120 

gatagtacat tgccatctnc aggaagactt gacggctttg ggattttgtt taaactttta 180 

taataaggat cctaagactg ttgcctttaa atagcaaanc agcctacctg gaggctaagt 24 0 

ctgggcagtg ggctggcccc tggtgtgagc attagaccan ccacagtgcc tgattggtat 300 

agccttatgt gctttcctac aaaatggaat tggaggccgg gcgcagtggc tcacgcctgt 360 

aatcccagca ctttgggagg ccaaggtggg tggatcacct gaggtcagga nctcgagacc 420 

agcctggcca acatggtgaa accccatctc tactaaaaat acaaaaatta gccangtgtg 480 

atggtgcatg cctgtaatcc cagctcctca gtaggctgag acaggagcat cacttgaacg 540 

tgggangcag angttgcagt gagcccgaga ttgcaccacc gtactnnaac ctgggtgaca 600 

gagcgagact tatcttatan ataaatagat ngatcttcac ctgggtgaca naacgagact 660 

tatagataga tagatagata gatggataga tngatngatn gatagataga ttgataaacg 720 

gaattgggcc ttttgcttta atgaaa ■ 746 

<210> 737 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<400> 737 

ntnnnncttt ttgatcantc ctttnttgga tcccnttgct acttgttctt tttgcaggat 60 

cccatcgatt cgaattcggc acgaggctga cctacagcag aagctgctgg atgcagaaag 120 

tgaagacaga ccaaaacaac gctgggagaa tattgccacc attctggaag ccaagtgtgc 180 

cctgaaatat ttgattggag agctggtctc ctccaaaata caggtcagca aacttgaaag 24 0 

cagcctgaaa cagagcaaga ccagctgtgc tgacatgcag aagatgctgt ttgaggaacg 300 

aaatcatttt gccgagatag agacagagtt acaagctgag ctggtcagaa tggagcaaca 36 0 

gcaccaagag aaggtgctgt accttctcag ccagctgcag caaagccaaa tggcagagaa 420 

gcagttagag gaatcagtca gtgaaaagga acagcagctg ctgagcacac tgaagtgtca 480 

ggatgaagaa cttgagaaaa tgcgagaagt gtgtgagcaa aatcagcagc ttctccgaga 540 

gaatgaaatc atcaagcaga aactgaccct tcttcaggta gccagcagac agaaacatct 600 

tcctaaggat acccttctat ctncagactc ttcttttgaa tatgtcccac ctaagccaaa 660 

accttntcgt gttaaagaaa agttnctgga caaaacatgg acatngagga tctaaaattt 720 

ggtcanagca tctgtgaatg agcatganga t 751 

<210> 738 
<211> 795 
<212> DNA 
<213> Homo sapiens 

<400> 738 

aatccctttg ctttaancct tgtttgaacc cctttggaac tncctctntn tgnaggatcc 60 

catcgattcg aagagcncan gcaggaagag agagaccctn actgctgggg anttnctgcc 120 

acactcaagt ccccaaccca ctggaatctc ccctactaca agtgccatgt anaccccttg 180 
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aaaaggggag gggcctaggg agccgacctt gtcatgtacc atcaataaag taccctgtgc 240 

tcaaccaaaa aganaantan anaaaactcn agcctctaga actatagtga gtcttattac 300 

gtagatccag acatgattng anacattgat gagtntngac aaaccacanc tcgaatgcng 360 

tgaaaaaaat gcnttatntn tgaaanntga natgctatat nnntcattnn ttaccattnt 420 

antctgcagt aaacaaantt tacagcancn nttgnntnga tttcatgtnt caagttcaag 480 

gnganntgtt tggcgttnat ntaattcggc ccnacncgng acccttttgc attgggcccn 540 

nnacccanct ntagttccct nttagngagg ggnaattgcg cnctttggcg taataatngg 600 

gcanangctg nttttcccnn tgtnnaaatt ggtttatcca gtttannaat ttcaacacga 660 

tnaatatcaa acccggtaag cnattaaatg gtnaaaaacn ntgnggggng cccttaanga 720 

gttgaactta accnganatt aaattgcnnt tncgcnttna atntcccncn ttttaaatcc 780 

nggaaaacct tcccc 795 

<210> 739 
<211> 763 
<212> DNA 

<213> Homo sapiens 



<400> 739 

ttnnnnncct catnaatccc ttctttgatc cctcnccnca aaacccttgg cnactcgctc 60 

tttntgcagg atcccatcga ttcgaattcg gcacgaggca nccttcgcct cctgggttca 120 

agtgattctc ctccctcaca tcccaagtag ctgggactac aggcacgtgc caccacaccc 180 

agctaattnt tgcattttta gtacaggcag ggcttcatca tgttggccag gctggtctca 240 

aactcctgat ctcaagtnat ctgcccactt tggcctccca aagtgctggc attacaggaa 300 

tggagccacc gcgcccagcc tgatttcttt anntangtct tgtcangaaa natattgant 360 

ctnttgattc ntnaacatgg cnttnggtcg tctttaatnn gnctcatcan tgcctccatg 420 

tgttnttgat gccttngaac tggtattttt aaaatnncaa tttctaattg nnnattatnn 480 

aaacacaatt gggntnnata tattggcatt gtattaatgc aactttccta aactcactag 540 

taattctagt agcntnantt ggtanattct taaggatttn ctgngtnaat agncatgtca 600 

tctgtgaatn aagccattct ttganccttt tcaaattttg agccttgtat ttcttattct 660 

taccatatca cattggcaaa gacctccagt atganattga ataaangtgg tganagaaaa 720 

caccctncta aaantgctng aattacaggc atgaaccacc ntn 763 

<210> 740 
<211> 765 
<212> DNA 

<213> Homo sapiens 



<400> 740 

tnnnnnnnnn tttttnaacc ntttnttgna tncntctntc aaatcgcttg gctacttgtt 60 

ctttttgcag gatcccatcg attcgctagc ctgggcaata tagtacgacc ctgtctttac 120 

taaaaatgca aaaattaacc acgtatggtg gctcacacct gtagtcctgg ctactgagga 180 

ggctgatgca ggagaatcat ttgaacccag gaggtcaagg ctgcagtgag ctatgattgc 240 

accactgcaa tccagcctgg acaacacagt gagaccctgc ctcacaaaaa ttatattctg 300 

attttctgag tccatgaaca cattgtccaa atggattttt ctagctcctc caagttacag 360 

atagttccac gcacacacag aactcaccac tctcaaatat tttccccact agtattacta 420 

ttaaattttt caaacatgca aaagatgaaa gaattgctca gtgaacacca tgtacccacc 480 

acctagattc tacaattaac attttaccct actttcttta tcacatatat gtacctatcc 540 

atctatccat tcttccatga atccatcaat tcatctaatt ttttatatat ttcaagttaa 600 

gttgcagata tgtagcttat gtttcacctt aaatgtttct gcctggctat tattaactgg 660 

agtgcaatat gtttttggnt cttctttatg gtaaaatcta tgttcagtga aatgcacaag 720 

acttangtat gccattaata gggtttgacg aatagacaaa ccttn 765 
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<210> 741 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<400> 741 

ttnganccnt tnnntnnttn nntnaatgaa gccatttgct acttgntctt tttgcaggat 60 

cccatcgatt cgaggaaggt ggaggggcag gnaacaggac ggacaggccc cgggctctgg 12 0 

cacatcctgg ggaacaaggg accacaagga cgggggcagt ctccagactt cccctgggcg 180 

cttgacccca ggccttgcag gggagagagc cagggcctcc ctcaggtctt tgttcatgct 240 

gttttccctg ccgtggacac cctttcccgc tctccgattc tctaaatcct gccccatctc 300 

ccagatcttg ttcatgtcca agcttttcca ggaagtctta gcagctccca caccgcagag 360 

ctcgagatgt ctccctgact tggtcccaga ccccaactat gtgcaagcat ccacttatgt 420 

gcagagagcc cacctgtact ccctgcgctg tgctgcggag gagaagtgtc tggccagcac 480 

agcctatgcc cctgaggcca ccgactacga tgtgcgggtg ctactgcgct tccccancgc 540 

gtgaagaacc agggcacagc agacttctnc ccaaccggca cggcacacct gggagtggca 600 

caactgccac cagcattacc acagcatgga cgagttcanc cactacgacc tactggatgc 660 

aaccacaggc aaanaangtg gcccanggcc acaaaggcca atttctgnct ggaggacanc 720 

acctgtgact tnggcaacct naaacgctat gen 753 

<210> 742 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 742 

tnganccttt cgnttctnen ctcctaagcc tttgetaett gctctttttg caggatccca 60 

tegattcgea ggacatggag cagtacctgt ccactggcta ectgeagatt geagagegge 120 

gagageccat aggcagcatg tcatccatgg aagtgaacgt ggacatgctg gagcagatgg 180 

acctgatgga catateggae caggaggccc tggaegtett cctgaactct ggaggagaag 240 

agaacactgt gctgtccccc gecttangge ctgaatccag tacctgtcag aatgagatta 300 

ccctccaggt tccaaatccc tcagaattaa gagccaagcc ancttcttct tcctncacct 360 

gcaccgactc nggcacccgg gacatcagtn agggtgggga gtcccccgtt gttcaanccg 420 

atnaggagga agttcaggtg gacactgccc tggccacatc acacactgac aganaggeca 480 

etceggatgg tggtgaggac agncactctt aaattgggac atgggcnttg nctggccaca 540 

ctggaatcca ngtttggctg tatgengaat tncacctgga aaagecaagg ttggtntata 600 

ganggtcttg atttttacnt anttgncaat aatgggttga gnaaacttaa agaaccagtt 660 

taacaataaa atngttaggg accegttnan aaaatggang tctnccttcc atntnaacct 720 

ggannccttn aaacntttnt gngtccnaat tttcgttnca tccannn 767 

<210> 743 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 743 

naancctttc nnncttcgen attcnaanng ntnggaaagc tcantcgctc natagngenn 60 

gggcttcgcg agnnntggga natnacanag gctngttanc ataccngttt ttnactgean 120 

aggnnnccac angcagcatg geccatgnna tgnccatgcc antgatggen ggnggccatg 180 

ctgtcagcgg annegcaett gtgaggancc nntntggann cngtanncna canncacccc 240 

cagtctggna cccnagtgtt cttactacac caantgaaac gctggnnagc caagagcccn 300 
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gatggcccac gtnccctgca tgganccccc tgancngact ccaccagcct atacangngg 360 

aagccanaag cagctgtttt cngccntgcc ctgctgataa tgccttgaag accccatacg 420 

acctnnacgg nctacattga cantnngact gtgncancct ngatcagatn atcctggaac 480 

tgggnccnng attccaggan cttnccntca atggacctgg gngcttgtaa tcngttntgg 540 

accatacanc cnttgtanna gataaaagan ngaggaaatc tgaaaccntn gnaataagat 600 
ctgnggcatt agtnnntcaa ggggaggntn ggtnncaaaa cnctatgagg aagaacgatg - 660 

gnactatgtc catgnaaggg gaacatntan tgttgganna tgcnatgcaa ncntnnccnt 72 0 

gatntaacnc tttganaaac tnangcttna caaaggggga aaaanact 768 

<210> 744 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<400> 744 

tnnnnncnnt tnnnnttnat ncntctctca aatcgcttgg ctacttgttc tttttgcagg 60 

gatcccatcg attcgcttga cctctgtact ttaaaggaaa tcactaacca aattttcaaa 120 

gtttcctttt aaatgcgttt agctagaaat ctatgtattt atccctttcc tattttgcat 180 

tcttctccca ctatttttaa aaactcattt acagtagaaa ccattcttct ttctcccaac 240 

agtatccttt gccaagacca tgagaacagt aaggagcatg ttgttggtca gggtttcaga 300 

atacgcgtga tgtcactgag aatgtttgct cacagtcaat aattgtcttt gtggatgtga 360 

taattttgga gatacacttc tggtcagaac tcaggtgaga taatcttgca atactccaaa 420 

tgcagatact ccagccaccc gcaaggttcc aggaaaggac aatgtcctgc gagaaaatca 480 

ggaggcctcc acttcctggg ccacttgaga agttcctggg catgtcacta catgttggtt 540 

gactcagcca tttctcatgc tgntttgttt cttgcggtgg ccacttaacc ccaaagaatg 600 

aanggaggat ccacagtgaa agtgcctgag tttctctatg agaccagatg ctgtcgaaac 660 

caaacatctt ttcctttgct ctatnggaac attttaaggg ttggtttgca caactggttt 720 

tcagactngg aagattacca agtttgggtc ccccctn 757 

<210> 745 
<211> 751 
<212> DNA 
<213> Homo sapiens 



<400> 745 

cttnttnnnt ttnntttgat ncctctacnc aaacccttgg ctactngctc tttntgcagg 60 

atcccatcga ttcgaattcg gcacgaggaa naacagacag gtttcaacat ggatggatct 120 

gaaatgctgt tgaagcatat catttgcata aaaatcaggg acagtttcca aagaattata 180 

tatttttttc agttggctct ctagttagtt tttttgggag taaggacaaa cctggaatag 240 

atagcaaaac tgaaaatcan cagtgctgat ggtggtacat atgtctttcc tttagcttct 300 

cccctgataa ttcccatctg cttttacttc gggtgagcag agggggatgt gtgtgtgcgt 360 

gtgtgtcagt ctgtttgtga gtgtgttaaa ggctacagac cacagttggt ttaaaatgct 420 

tggaacttcc caaactggct ttactttatg tttatacagt gctcagggtt aacgcagtac 480 

atccatgcca ttgctgtggg aggtatcccc ggatgcatgt gttttgagtc tataaatata 540 

gaaaatatat attggtttct ttttccaact taatangttt attaaagcat gaaatgaaag 600 

ggtgcatatc atgcattcaa gntatntcct aatttttggt ctgacagtgc atgtctttgg 660 

agcatgctga aacaanaatn acacaggaat tgantaaccn gaaagaaaca ttgttaaatg 720 

tccaacattt gttatgcatt tntattgggg g 751 

<210> 746 
<211> 760 
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<212> DNA 

<213> Homo sapiens 



<400> 746 



tnmmntntn nnnnntttnn 
gcaggatccc atcgattcgc 
attgctccca aagagaaagg 
tagttgatga tggtatggtt 
ttccaagtaa agctcttcat 
ctgagggaag tcaaaagcat 
gatgtgaaac ggaagagcga 
gggaacagct aaaggcagaa 
aaatacgcca agcaaataaa 
tccaataaaa tcttgggcca 
ttttggaatt ncagaagact 
tttcaagctt gngaatttgt 
ttggctgtac tgggttatca 



nttcntnnnn ctttgaancc ctttgctact tgctcttttt 
tgaaacaaaa gatgtatttc aattaaaaga cttggagaag 
ctttactggn tntgtcangt aaaagaagtc cttcaangct 
gactgtgaga ggatcggaac ttctaattat tattgggctt 
gcaaggaaac ataagttgga ggttctggaa tctnagttgt 
gcaagcctac agaaaagcat tgagaaagct aaaattggcc 
accangctag caaaagagct ttcttcactt cgagaccaaa 
gtagaaaaat acaaagactg tgatccgcaa gttgtggaag 
gtagccaaag aagctgctaa cagatggact gatnacatat 
aaagaaaatt tgggtttgaa gaaaataaaa ttgatagaac 
ttgactacct ngactaaaat attccatggt ggtgaaagat 
aaattttnaa ctattatcta actaatgtnc tgaattgccn 
ttttattaat ggtaaataaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 



<210> 747 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 747 

tnngncttta nnccntttnn attgnnnnnn nttgaaaccc ttggcnactn gctctttntg 60 

caggatccca tcgattcgaa ttcggcacga ggaggctgtg tcaaagaatg aatggaacgc . 120 

ctactatgag gaggtgggtg tacgtnctag anggagatcg agtacatgat ccagaagctc 18 0 

cctgagtggg ccncggatga gcccgtggag aagacgcccc anactcanca ggacgagctc 240 

tacatccact cggagccact gggcgtggtc ctcgtcattg gcacctggaa ctaccccttc 300 

aacct caeca tccagcccat ggtgggcgcc atenctgean ggaactcagt ggtcctcaag 360 

ccctcggagc tgagtgagaa catggegage ctgctggcta ccatnatccc ccagtacctg 420 

gacaaggatc tgtacccagt aatcaatggg ggtgtccctg agaccaegga getgetnaag 480 

ganaggttcg accatatcct gtneaeggge ageaeggggg tggggaagat catcatgacc 540 

getgntgeca agcacctgac cctgtnacgc tggaactggg aaggaagagt ccctgctacg 600 

tgggacaaat aactgtgaac tggaccttgg nettnetaac attggncttg gggggaaatt 660 

catnaacaag ttngecaana cctgcgtggg cccctgaaat acattctttt nggaccccct 720 

tgnatccaga accccaattg nnngnngaaa acttnaaana aantnncttt naaaannntt 780 

tttnet 786 

<210> 748 
<211> 722 
<212> DNA 
<213> Homo sapiens 

<400> 748 

tggaactngc tetttntgea ggatcccatc gattcgaatt eggcacgagg aggaagaggc 60 

ctgctccact tgtctgggaa cctgggcagg aggcacagag gaagecaagg cctggagctg 120 

caggtccccc ggcatctctc tctgtcccgg cagcccagga tggcctggtg cccccacctg 180 

ctgeagcagg agccccaagg agtgctagct gagggtggtt gctggggtgg tcctcatgga 240 

cagtgaggtg tgcaagggtg cactgagggt ggtgggaggg gatcacctgg gttccaggcc 300 

atecttgetg agcatctttg agcctgcctt ccggtgggag canaaaaggc cagaccctgc 360 
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tgagttanag gctgctggga tccactgttt ccacacancn ggaaggctgc tgggaacagg 
tggcanagaa gtgccatgtt tgcgtngaac cttgcantct tncanctggg gactggtnct 
tgctgaaacc cacgagctgn acantnanga gctgtccanc ttgcttggct cactgngacc 
aggaaagcct gtctttggtt agctcgtgtc ttctgcagga aaaaaaaaag gatgtgtcat 
ttggccatga tatttgaaaa aggggaagga tngccnaant ttgtttncca tttattccag 
tanttggaaa attttttgac cccctnngct taattctttt gcaanaacta ctggggggtn 

tg 

<210> 749 
<211> 821 
<212> DNA 
<213> Homo sapiens 

<400> 749 

tttnaanncc cttgctactn gttctttttg caggatccca tcgattcgtn gacatagaaa 6 0 

acatacagta agaatatggt attataatct tacggggacc actgtcaaat cgcggtctgt 12 0 

ctttgaaaag ttgtnatggc ggcgcatgac tataaatacc ctagctggtt agcatttaca 180 

ttccttgcca gggagtttga aatttatnct nggcgggctg nctttaggnt ttaggtagag 24 0 

ttaaagaggt aaagcacatg tttgccacaa cccaggaaag tatttttaag aaagatttgg 300 

attttcctac ctttagagat ctaaaaaaaa tttaatataa aaaatcattt tgagntggtg 360 

tttattacta gttcagaatg agtggctgct gaagggggcc cccttgnnat tttcattata 420 

acccaatttt ncactttatt ttgaactctt aagtcataaa tgtataatga ctttatgaat 480 

tagcacaggn taagttgaca ctttgaaact ggccatttct gnattacact atcaaatagg 54 0 

aaacattgga aagatnggga aaaaaaattc ttattttaaa atggcttaga aaagttttca 600 

agattacttt ggaaaattct aaacnttnct ttctgngttc caaaactttg gaaaatatgg 66 0 

tagatnggac ctcattgcca tttaagactg gttttcaaaa gctttccctc aacatttttt 720 

aaaggtgtgg anttttccct ttttaaatat tccataattt aantttcctt ttnaaaggcc 780 

nctnnttttc ccaaacccat ngncttttgg ggnaaatccc c 821 

<210> 750 
. <211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 750 

gntttnnnnn nnctttnttn nntgnctntt tctaagagct tngcnnatgc tnggtcggca 60 

cgaggcaaca tttgtctaca actctactgt aaaattggaa atgcttttcc acagaaaaac 120 

ctctcaaaat gctgaatgca aaagttggga tcacagaaac attgtgccta tttttggtct 180 

gctggaaact gtatttntac aaggtaatcc ctgttctcaa tatagttcct gtcttgccac 240 

tggcggtttt cttgtagcat ttttctagtt ctgagattgc tactacccaa agtattcatt 300 

tctttcttac tggggtgtcc tctgtcttca cagcctgctt ctggattgta ggttttttcc 360 

tttctttctg ttgagatatt tatggcattt gatagagtca aaccagatgt attgcagccg 42 0 

gacatactta tgtggcttca gatgtgtaaa ataagtaact tcctatcttt gtctgtctag 480 

ctcaagagtt gactgtggac gaggaatgcc tgtattgatt cattaatgta ataactattt 54 0 

actgactgcc taccatgtac aaccagaaac acagttccta acctcatgaa cttaccatgt 600 

aacatgggaa gacaagccta agttcttatt tggntggnaa ttgcgataac gctcacagaa 660 

caaattcccg attcctacga acccatgtat aggggggaaa tatttaaggt cccatttaat 720 

actgacattn gcccnccccc ctnntatttt aagctgagaa tctgaaggnn 770 

<210> 751 
<211> 774 



420 
480 
540 
600 
660 
720 
722 
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<212> DNA 

<213> Homo sapiens 



<400> 751 



cgttnnnttt ccncctttga 
tgtcagttct gtgtagactt 
actttttaat tcctttgatc 
agtaagcata gttttatcca 
aaaccaagca caccatagcc 
ttggtgtttg ttttttgact 
cagtaacgct gtttatctca 
aaaaaagatt gataaagtag 
actgagtgaa attgggatac 
catgtctcac ttggaaagct 
taattatccc tgcctgntgt 
tttaccttat ctggaaaaaa 
actnttagng agnccntttt 



agcccttttt gcaggacttt cnaatncttg gtagacttta 
tatgtcagtt tttgtcatta tttgaaaatc tattctgaca 
ttataagtta aagctgtaac aactgaaatt gcatggatca 
gggagacngc tcnnnggaag ccatagaatt gctctggtca 
ttaactgaat atttaggaaa tctgcctaat ctgcttatat 
gttgggcttt gggaagatgt tatttatgac caatatctgc 
cttgctttga aagccaatgg gggaaaaaaa tccatgaaaa 
atgattttgt ttgtatccct acccatctcc tggcagccct 
atttggctgt cagaaattat accgagtcta ctgggtataa 
agtcctttta aatgggtgcc aaaggtcaac tgtnatgaga 
ccatgtcaga cttttgagct gatcctgaat aataaagcct 
aaaacattnt anancaaaaa aaaactnnga gccctttana 
ccgtagaatc ccngacntgg ntaaggaanc nnnc 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
774 



<210> 752 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 752 

gntttgaann ccnttgtttc gnatcctttt tgnaggactc tgaagncctt tgttcggcnc 60 

gagaagaaac tctgcctcag aaaatgttta cagcttccag tggaatcaaa cataccatga 12 0 

ccncaattta tccaagttct aacacattag tagaaatgac tcttggtatg aagaaattaa 180 

aggaagagat ggaaggggtg gttaaagacn ttgctgaaaa taaccacatt ttagaaaggt 240 

ttggctcttt aaccatggat ggtggccttc gcaacgttga ctgtctttag ctttctaata 300 

gaagtttaag aaaagtttcc gtttgcacaa gaaaataacg cttgggcatt aaatgaatgc 360 

ctttatagat agtcacttgt ttctacaatt cagtatttga tgtggtcgtg taaatatgta 420 

caatattgta aatacataaa aaatatacaa atttttggct gctgtgaaga tgtaatttta 480 

tcttttaaca tttataatta tatgaggaaa tttgacctca gtgatcacga gaagaaagcc 540 

atgaccgacc aatatgttga catactgatc ctctactctg agtggggcta aataagttat 600 

tttctctgac cgcctactgg gaaatatttt taagtggaac caaaataggc atcccttacc 660 

aaatcaagga agactgactt ggacaccgtt tggaaaatgg gtaaaaacgg tggnttactg 72 0 

gtganttggg gagcnagaac cggacccact ggtatactgg ggantaacaa tttttttc 778 

<210> 753 
<211> 775 
<212> DNA 

<213> Homo sapiens 
<400> 753 

gcttttgaaa cccttttgtt aacgcctttc tgcatgatct tctcgtcctt gaaagggccc 60 

taaaagagat gaacaatacc gtatcatgtg gtttgaatta gaaacccttg tcagagccca 120 

tatcaacaac tcagagaaac atcaaagagt cttggaatgt ctgatggcat gcaggagcaa 180 

acccccagaa gaggaagaac gaaaganacg cggctgaaag agggaagaca aagaggacaa 240 

gtcagagaaa gcagtgaaag attatgaaca ggaaaagtct tggcaagact cagagagatt 300 

aaaaggaatc ttagaacgtg gaaaagaaga attggctgaa gctgagatta taaaagattc 360 

gcctgattcc ccagaacctn caaacaaaaa accccttgtt gaaatggatg aaactccaca 420 
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agtggaaaaa tcaaaagggc cagtgtcgtt attatccttg tggagtaata gaatcaatac 480 

tgccaattcc agaaaacatc aggaatttgc tggaccgttt gaactctgtt aataacagag 540 

ctgaactata tcaacatctt aaagaggaaa atgggatgga gacaacagaa aatggaaaag 600 

ccagccggca gtgaagagtg acttgangaa ctaaatttta gcatattgca aaaatatttt 660 

gtgcgggaat tcgatatnag tacttttacc agcaagatgg natngttatg tttgcctgga 720 

ctggntttta catttttnaa attttttcag tgnccttttt tggtcctaaa ttatc 775 

<210> 754 
<211> 1032 
<212> DNA 

<213> Homo sapiens 
<400> 754 

ggnntttttc ccaaaaaaaa ggggccccct nggggntttt tncncaanng gncccctttt 60 

tctttgncca gggnaacntt ttttgngaaa aganccccct ttttggatnn accggggccc 120 

cccggaaggt tccnaaattt tnagggttna aacccaaatc cttggggaaa aaaaaaaaac 180 

ccagggccnt ntntggggnc cccctngggg gggtngggaa aaaaaaaaaa gggggaatgg 240 

cccccaaaaa aaaatnnggg gcccctnggg ggaaaaaaaa gggaaagccc aggtngggaa 300 

nggaaagggg gaaggntccc ccggggggaa aggaaatggg tgggtnggna atggcccaat 36 0 

ggttggaaaa ggcccaaacc aatttgggnt ntaaaacaat ttcaacctgg gggggtcctg 420 

gcccanaaaa aatgcngggc acccncgngg ggtctggctt aagaattggt tacaagggca 480 

aagggaaagg gaagagttct agagataaag aactatatgc ttggatgaag tgtgtgaagg 540 

gacagcctca tgatcacaaa catttaatgc caacccaaat tatacctggt tctgttttga 600 

cagatcttct agatgccatg cacactctta gggaaaaata tggtattaaa tcccattgnc 660 

attggactaa caaacagaat ttacaagttg gaaattttcc tacaatgaat ggtgtatctc 720 

aagttttaca gaatgntctt aatcacagna ataaaatttc tctgtgcatg cctgagtctt 780 

cagcagcaaa aatactcctc cgaagtctga gaaaaatggn ggcagcagcc caagaagagt 840 

gatgtaggca cagataacna aggntaacct cctccagaat ccccagtcac cactgcactg 900 

gttaagcaga acttngcagg agcaaaaaag cccngangan ggaaaaaaaa aannaaaaaa 960 

aactcggagc cctcttagaa ctatangggg ggccgnnnta ccgnangatc cccgacctga 1020 

anaggaaccc cc 1032 

<210> 755 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 755 

ngnnnnnttt nncccnacna aatccctttt ttgaagcctt ctantgnctt catcgtnctg 60 

gtaaattggn tgaattattg tattgaagct tgagctgtat tttnaagtaa tttnggttnc 120 

ccctaagatg ttattatgtt agggacataa cacttttggg aggttgttgt gggagatggt 180 

tgatttaggt tttcaaaagc tagaaataaa atttacatnn ccccggntnn cataaaattc 240 

tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctnctaagg ttatgtccta 300 

ataactattc ttttaggagt atacttctac tttatagaag gttgcttttt ctttttaatt 360 

ttntctaaca aagaaaagaa tnaagtattt attaataaag aaccagaaag cacttgaaac 420 

tgatgttttt aaatgggctc acttanggta gatttattta tctcattaac ttaaaaacag 480 

ctatgtgnat tgaaataagt cacaacagaa cttgaacacc agggtgggtg tctgagcaat 540 

cccctttctt atggggaaaa acaaatggtt cttgtttgaa cangaaggta tcattgcagt 600 

cngcattcac ccgtgtataa ttgnnatata agntgnataa tatgctcgta aaggctnaag 660 

gtnagctgga tctggatgcc ctttnaccaa ttangatttt aacttttaan aataaaattt 720 

naaanctaat tgncnaaata aaaaaaatan naaacttcgg ncctctacaa nttntagatg 780 
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ngtcgattnn cgnncanc 



798 



<210> 756 
<211> 834 
<212> DNA 
<213> Homo sapiens 

<400> 756 

tttgaaaccc ntttnttnaa gcctttttaa tgactttanc gncctttatt cggcacgagg 60 

tccttcagct ggtagcttnc attcgnantt nnanatanta tntgtgcatg cncnnttgaa 120 

tttttgtgga agaacagant gcagaagaag gcnaggaaag ccgaagagan tnntncggca 180 

ncagaagctt aaagnaggcc aaactggtgg tgcnctttcc tcggnacaga agctggatga 240 

ctatggccaa tttggagaaa nagctccagg agatggaggc acggttcgag aaggagtttg 3 00 

nagatggatc ggatgaaaat gaaantggaa gaacatganc tcaaagatga ngatggatgg 360 

taangacagt gatgaggncc gaagacnctg agctctatga tgacctttta ctgnccanca 420 

tgtgacaaat cgtnaanaac agtaaaggcc atgaanaatc acntagaagt caaangaaag 480 

cnnttgggaa aaatggnggn nctttgntaa aaccacnagc tgganggang gaagaannna 540 

aaatttttta agnacctcaa attgattgaa aaatncatta tgatgacaat tcctgnanga 600 

ataaattggn agatgcncta naancaaaan gcnttttttn antnnaaana nacaaannnt 660 

nnagcctntt ngaacntata gtnnannctn cntttanctn tntatcccgg actttnttnt 720 

ggataccntt gactnagctt ttggacaaaa ncncnacttt gtattncatt ngnnaaaaaa 780 

atgcntttat ttttcgnaaa tttggtgaat ncntaattng ntnntattnn nnnc 834 

<210> 757 

<211> 1062 

<212> DNA 

<213> Homo sapiens 



<400> 757 



tttttccaaa aaaatcnccc 
gggccagggg aatnccccca 
gccccttggg gaaagggncn 
tttngggggg aaaaagcccc 
aaaagggggg aacctttngg 
ctggtttggg aaagggaaaa 
nttaaaatgg aaggtttggc 
tttgggancc ctgcttcatt 
tancaggaaa anggagatga 
cnacttaaat gctttttgaa 
atcaactngg tagaatttca 
atttggtaaa atgccttttt 
natttcaaat ggtttaacaa 
tgaccaggaa agattttnaa 
gaagaaaaat tgggatttgg 
ntcnnannat nnttnatnnt 
nannnannaa ataaatantc 
tttnannnta nntatctaan 



ccnttttttg gccttnaana nanngggccc cctttttttt 
atnccggaat tttccggggt ntttgggggg nttggaaagg 
tttccnaagn aaaggggtng gaaaattttn taaatggcct 
ctnggaatnc ccccaaaaaa cccttgggaa aaagggggga 
gnaatccttn cccnttnaat aatttggggn aattaaancc 
gggttggtct tggtcttggg ggaanggaat tgggggccaa 
canaatnggc cncttcgggg gcttnttcaa aagccaagcc 
tttngggccc tttnctgcca aggaanccca acccttaact 
aaggccttct tccaaggaag gtaaggtcct ttggctgccc 
antctcttag atgtggnaaa tattttttcc gaaccttgaa 
attggaagca taatccattg taaaatatat tttagttgat 
tggtggtgtg gttngaatcc tgggtttccc aagaatcttg 
angggaagga aaggganctt ttcccttaac cttccctttt 
aagtaccttt ctttttaagg aaaaaaaaaa attaaatttt 
attttanaaa aaangggaaa aaaaatatna ntattnatan 
ctanntantt nctntnnnta ntnctnntnt ntnnannnna 
nnncatnctt anctacanat nccnntcttn nttntannac 
tctntcccta ttntaccctn nc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1062 



<210> 758 
<211> 845 
<212> DNA 



237 



WO 99/33982 



PCT/US98/27610 



<213> Homo sapiens 



<400> 758 



aaancccttn tttnaaatcc 
tagcgtcggn tccgcntggg 
tntgcttacg ggcaaaaacc 
gaaggtgatt gcctttgggg 
tgntttggac agaantggga 
tgtgttcctg tcantcacgg 
gtccacggca tgcccaattg 
natcaagcct aatgtgggag 
ggattgggtg ggantggang 
aaaaactttt gggtttttaa 
tgggcctatt naaaaaaaan 
aaantcttnn agccttttta 
ntaacnttta attaaaggaa 
tttaaaattc ntntngaaaa 
tgctn 



tttttanang attcatcgat tcgaattcgg nacgaggcgc 
cccttgcggt gcgctgnggg caggcggtga ggcttacgcn 
tgcacacgca ccanttcccg tnnccgttgt ccaacaacca 
aancttctan gncaacnacn tgaacntatg gacagtgcgc 
acnttnaggn tgntgtgcgc ttcnagcatn tgggcacctt 
gtgagcanta tggaagcccc atccgtgggg cagcatgaag 
caacacgcac aaatacttgg aangccatgg aangcatntt 
cccttttgca agtcacgaat taactctnaa nngtntggat 
gttgcaagtt ngggccnttt tgaaaggcca ctttttggna 
ngggttcntc aaaatgccct ttgnnaattn aaagaaatgt 
atnatacttt atntaatctn nataataata nttantaata 
aaanttttta atgaanctct ttattttanc gttanantnc 
taacaatttg ttgaantttt ggtataaana ncccccantt 
aaaatncntt tattttggta aaaatttgng gaatcnnttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
845 



<210> 759 
<211> 947 
<212> DNA 
<213> Homo sapiens 

<400> 759 

tngggggggg ccccnanttt ggggccccaa acccttnggg gaaacccccc ttnnnnnttt 60 

ttnccntttt gggggggaaa ngcccccccc caaangnaaa aacccntttt nnnnaatttn 120 

ngggnanggg ntntggggnc ccnttaaccc caangggggg gggttttnan cctgggggnn 180 

naaaatnggg ggaanaantn nnnaatgggn antcccttna angggaaaaa naatttnncc 240 

ttaaggnnat gggncattaa tnttnatccc tantggattn caatttcatt cgnattaaag 300 

gcttttactg gnataatcct tnncggcccg cnctggtagt ttaaagtgcc canaanttga 360 

atgggaaatn acgggttttg aaaatcgcac aaagcagtgc cnggcacnga ggngtcacgc 420 

cngtaatncc agcattttgg gaggcctgag gcangcggat . cacganggca anagagtcca 480 

gaccattnct ggctaacacn gggaaacccc gggnctaata aaaaatcaaa aattaggntg 540 

gacatggtgg cacgtgccng taatcncagc tacttangga agctggatgc aggaagaatt 600 

gcgtgnnanc cnggccccng tggaangntg cattgatacg aagaaccgtg ccaaatgaan 660 

ttanannctg ggcngaannn gagcggaaaa agccctnttt aaaaaaaaan gggantggaa 720 

aaantggtgc canagncatn nggggaaaaa attttnnnnt tnnttnancg gttttnanct 780 

tgnggaaggc cntctttaat nttggggaaa aggcactttt gggntnggtt ttggaaaacg 840 

nntggctttt ccctttnaaa agggaaaaan ggnnttaanc ccctgaaaaa ngngcngnnt 900 

tttaaanggg gnnnnaaaca nggggncttt ggaancccca nnaaacc 947 

<210> 760 
<211> 759 
<212> DNA 

<213> Homo sapiens 
<400> 760 

gnntttctaa tgcttgtnnn nngcntttnt gcaggatccc atcgattcga attcggcacg 60 
agaagatatg cagagatatt ccaggatctt ttagctttgg tgcggtctcc tggagacagt 120 
gttattcgcc aacagtgtgt tgaatatgtc acatccattt tgcagtctct ctgtgatcan 180 
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gacattgcac 
ccaattctct 
tagctaactc 
ttgcanagca 
ctctgcatag 
catcttcatt 
gagatgataa 
atgctgcaga 
ccttgaacta 
ttggtngaca 



ttatcttacc ggctcttctg aagggtctat ttctgaactg gagcagctct 
accaaataaa gaattgatga cctcaatctg tgactgtctg ttggctacgc 
tgagagcagt tacaactgtt tactgacatg tgtcagaaca atgatgtttc 
tgattatgga ttatttcatt taaaaagttc tttaaagaaa aacagtagtg 
tttactgaaa cgagtggtca gcacatttag taaggacaca ggagagcttg 
tttagaattt atgagacaaa ttcttaactc tgacacaatt ggatgctgtg 
tggtctcatg gaagtanaag gagctcatac atcacggacg atgagtatta 
gttaaaacag cttctacaaa gccaaagaag aaagtncaga aaaatttgtn 
gagaaacttg ntntggaaca tttcaaaaga tgaatgacaa tctggattcn 
gtgtaatttg gactttaacc ngatgctcg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



<210> 761 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 761 

cctnactaaa cctttgcnaa ngccnttnnt gctgatccca tcgattcgca ggcctggact 60 

tcgcccccag gcctaggacc gcggtgggtn ttaaccctgc tnctgcccca acagggactc 120 

caatcaatcg gagttctccc cttgccggag ctgcccttca cctttggggc ccgagacagt 180 

cataagggat ggacttacnt ttcttgcagg gaaaaaggtg gacagccgtg tttcttaagg 240 

atgctgaggg catggggcca ggaccagggg agaggcacag ctccttcctg agcagcctct 300 

caccactgcc acaaggctcc ctaatgctgg tctctgctcc actccccggc ttcccgtgag 360 

gcangaggca gagccacagc caaggccctg accacttctg tgccagttgt ctaagcagag 420 

cgcctcaggg acgctggaaa tgccttaagg atagaggctg ggcatcacat caaatgggac 480 

tfftggtgttt ggtgaaaacc ttcctgagga tctggattca ggaccctcca tgactggcct 540 

atttactggt tacagctggc cagtgcanan ctgctgctct tttacctttt taggcccctg 600 

taacttncca cctttaaact gcccaanaag catgcctntt ccacaggaag aagggagcag 660 

acagggaaat ctgcctacca anaagggtgt tgtgtgtctt tgtgcccaca cgtggtggct 720 

ggggaatgcc tggatggtgc cgtggntgat ct 752 

<210> 762 

<211> 1032 

<212> DNA 

<213> Homo sapiens 



<400> 762 



ttctaatgct tggaaacgcn ttgatgnang atnccatcga ttcgaattcg gcacgagggc 
aagtggtagt ggcgcttntc gggtgntgtg cttcacgttt tggtctaaag gncgagactg 
ttgtggcnac ngngnaantn tacnggaang gnttaaantn tnnntgnagt nggaanaatt 
cnatcngaan gaanttgggg gggntagnnn nggttanatn attgatgaat ggnttcaana 
tngnaaantt tatnancgan atgnnatant tnnaaangan gaccaactgg gntnanatgg 
agnannnatn aannggntaa ncnatanana tantncattt ggtanganaa tngangaagg 
attntcaaat agncatgtng gangatgaac ntnnaggnnn nagaatattt ggataaaatt 
ggtantatga agatntggnn taataatacc nanaaatnnn nnantttnat nanngangaa 
ntagganttn atgnctatgn ggatannntn nanntatnat agngataaan tatgatactg 
tttannntat ntnganttag tnattnaatg ntcttgtnan aanttatttt ncgntagtta 
gntagnnnta tnnactttgg naancanana tgtaattctc tctanacggg aatntttnta 
tnntnnntat caagaggtnt ntnnattgna aatantatac nnttgnanaa antatatcna 
tanaanaaan ggnnattatt ntatatganc aaanaaaaaa ntattgngga nntanattat 
ctctcatnat ngattatncn gtantgtata atggnnnata antatgtnnn tntaanataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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atggatataa gtnttatant atgcncntna aggnggtcng anaantatgt aattatattn 900 

angctanata cnatnnanat gtntnactaa atatngntgt gaaangtntg cgnggnaaaa 960 

tntgttanta ntnaaacang gtataganat atanatgngn ngaatatcta ctatntgtan 1020 

atacttatan ca 1032 

<210> 763 
<211> 817 
<212> DNA 

<213> Homo sapiens 



<400> 763 

aanncccttn tttctaatnc ttggctactc gtnctttctg caggatccca tcgattcgaa 6 0 

tttcggcacg aggggaggga cccttggggn caggttgtgg gtagccagtt gcagtctgtg 12 0 

gcctccctca gaggtttgga gtcgggcgtg gcatgctgct gttggcctct ttccgaggga 180 

gtgccatcca ctccctgtcc caccgctnnc cctngtgagg acagtgaggg cagtgctacg 24 0 

tggtggggag gtgtgtgtga agccacggaa gggcttcaca gggcaaatgc caaggccagt 300 

gggccccgga cagagtnagg ctccctgggc ggncttgtgt cttggtggcc ctgatcatcc 360 

tgccaatgca naaagccagc aggcaagaga cccctactcc ctttaaggac cattagcata 420 

aacaaaccat tgngttgaat gcaatgatcc aggtgcactt tnagggtaca agctggactn 48 0 

gttggaacag gattacatgg aaaannggaa angggggcan gctgtctctt gggacatnag 54 0 

taatgtcttt ttacccantt gncactctng aanttcaaan ttggncatgt tttctggggc 600 

ctnctngnaa aagcagtttt ttcaccncat natgaagaaa aaacttgttg gcttgganng 660 

tanngggatt nttgntnana cttnccctaa anggntncct ttnggggcat ttntgaaggn 720 

taaataatgg gggatacctt tttaannttc cttgcagatt taaaaatgtt ccttaaanga 780 

nncctcaatg nttnggtctt nttccaaaaa acnattc 817 

<210> 764 
<211> 777 
<212> DNA 

<213> Homo sapiens 
<400> 764 

taatgcttgg ntctcgnttt tntgcaggat cccatcgatt cgaattcggc acgaggtcca 60 

cggtgctgaa catcatcatc tttgaagact gtaggaacca gtggtctatg tcccgaccac 12 0 

tacttggctt gatattgctt aatgaaaagt atttttctga cctaagaaac agtattgtga 180 

acagccagcc acccgcgaag cancnggcca tgcacctgtg ttttgagaac ctgatggaag 240 

gcatcgagcg aaatcttctt acgaaaaaca gagacaggtt cacccagaac ctgtcagcat 300 

tccgtcgaga agtcaacgac tcaatgaaga attccactta tggcgtgaat agcaatgaca 360 

tgatgagctg acacctnctt ggactctacc tgtacagagc agcgtccctt tggtttggcc 420 

cagaggggcg aacaattgca agggagaggg cctggctgat cctggctctt ttctccaggg 480 

gtgtggggaa aaatggcaaa ggtcaactag ctgcttcccc aagggaatag gggtgtgagt 540 

acactcacta nggggcaagg cgctgcttgg ttcctggggg gactgggtgg gaaagggtgg 600 

tgnganggag ataaagagat tcaaactgag actccagtct ttccttctgg gggccaccca 660 

aagttgggga gnaaccccct antggtncct gccaacaacc ttgccttggg attaaacatt 720 

ntncattttt ttcantaana tttttgaaca aagggttant attgnctnaa gtttann 777 

<210> 765 
<211> 774 
<212> DNA 
<213> Homo sapiens 
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<400> 765 



ntttctaatg cttggctctc 
aaaacagctt tgtaggtctc 
tatcaataca ccagtggctg 
tgtgcaaaca gctatggann 
ctgcaaaagt acaatagaac 
tggcacacaa agcttgctgc 
atgttgtttg ttacttcccc 
ttcccgaatg agtcaaaatg 
actgatgata catacctttt 
tgagcttctt gctggaagat 
agtaccctct tacttacaga 
catatttgaa aaatcctggg 
ctggnaagcc agattantng 



gntttgatgc angatcccat cgattcggga aatgcaagtc 
agagtttgct tttaagaagt agtacaagaa ggaatagtta 
aaattatcat gaaaccaaat gttggacaag gcagcacaag 
gtgaactcgg agagtctagt gccacaatca ataaaagact 
tttcagaaaa ttctttactt ccagcttctt ctatgttgac 
aacctcattt agagagggtt gccatcgatg ctctacagtt 
caccaaatcg tagaaagctt caacttttaa tgcgtatgat 
ttgatatgcc caaacttcat gatgcaatgg gtacgaggtc 
ctcgatgtgt gttatgctgt gctgaagaag tggatcttga 
tagtttcttt cttaatggat catcatcagg aaattcttca 
ctgcagtgga aaaacatctt gactacttaa aaaaanggga 
agaanggact atttggctnc ttttgccaac ttacttcata 
ctcaaggaag ttttgatgag ccaaaaaagt tttn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
774 



<210> 766 
<211> 779 
<212> DNA 

<213> Homo sapiens 
<400> 766 

ttnnncgctn ntgaanaccc cttctcctna aatccttttt aantnccttg ctgnntgatc 60 

ccatcgattc gcgaaattcg gtggcgccac gtccgcccgt cttngccttc tgcatngcgg 120 

cttcggcggc ttccacctag acacctaaca gtcgcggagc cggccgcgtc gtgagggggt 180 

cggcacgggg agtcgggcgg tcttgtgcat cttggctacc tgcgggtcga agatgtcgga 24 0 

catcggagac tggttcagga gcatcccggc gatcacgcgc tattggttcg ccgccaccgt 300 

cgccgtgccc ttggtcggca aactcggcct catcagcccg gcctacctct tcctctggcc 360 

cgaagccttc ctttatcgct ttcagatttg gaggccaatc actgccacct tttatttccc 420 

tgtgggtcca ggaactggat ttctttattt ggtcaattta tatttcttat atcagtattc 480 

tacgcgactt gaaacaggag cttttgatgg gaggccagca gactatttat tcatgctcct 540 

ctttaactgg atttgcatcg tgattactgg cttagcaaat ggatatgcaa gttgctgatg 600 

attcctctga tcatgtcagt actttatgtc tgggcccanc tgaacagaga catgattgna 660 

tcatttttgg tttggaacac gaatttaagg cctgctattt accctggggt atccttggat 720 

tcaactatat catcggangc tcngtaatca atgagctaat tggnaaattn ggtggacac 779 

<210> 767 
<211> 799 
<212> DNA 

<213> Homo sapiens 
<400> 767 

gnnnnnnttn cccgcctttn gaaancccct tctttctaat gcttgttcaa cgcctttgct 60 

gcaggatccc atcgattcgt ggatactgac aatggtggca ggcatttcaa gccttttaaa 120 

ttagtacttt ttgtcgnctt gcttattaaa attttgttaa ttttagcaaa gaccaattgt 180 

tgtgataaac tggtgttttt nggatgcttc aagcacacgt taaccaatcn gccaatnccc 240 

ctttnggttc ctcccattgn tctaaaatag gactttcata ttattaaaac ctcaaaagat 300 

gatccaccca ggatgaacaa agatcaccaa ggggaaagaa aacatttttt atctttacag 360 

aaaacatgtt aagattatat atagatgtat tctttacatt ggatattgta ttagagtcct 420 

ccttacaaga aatgaaatag gtttttagca ctcttagcat tagagttcct agattggtgt 480 

tgatagctac agttttaaaa tgtataacct gaaaatgaag gttaattttg cattgtaaag 54 0 

agcacatttg atctatgtaa aaagtgtcca tttggtgtat tttttttaaa aaagagaaag 600 
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cactttcata ttaagtagca tgtgtatgaa tttaagattt tcatatttgn tgngtctggt 660 

attcagtgaa gtaaaattga gcattttaaa agtttggtgg atggcaacca ttaactatta 720 

aattaaaagc caccttatac tctgctgctt aacttgcttg naaattgcac ctttggnacc 780 

ctgcacattt tcatattnc 799 

<210> 768 
<211> 826 
<212> DNA 
<213> Homo sapiens 

<400> 768 

gnnnntnncn ccctttctaa tggcttgttt ctaaatgctt tttcnaatcc ttggtacatg 60 

atcccatcgn ttcgcgctgt gcttgagacc aacctgacgg gtaccttcta catgtgcaaa 120 

gcagtttaca gctcctggat gaaagagcat ggaggatcta tcgtcaatat cattgtccct 180 

actaaagctg gatttccatt agctgtgcat t6tggagctg caagacnggg tgtttacaac 240 

ctcaccaaat ctttagcttt ggaatgggcc tgcagtggaa tacggatcaa ttgtgttgcc 300 

cctggagtta tttattccca gactgctgtg gagaactatg gttcctgggg acaaagcttc 360 

tttgaagggt cttttcagaa aatccccgct aaacgaattg gtgttcctga ggaggtctcc 42 0 

tctgtggtct gcttcctact gtctcctgca gcttccttca tcactggaca agtnggtgga 480 

tgtngatggg ggccnggagt ctctatactc actcgtatga ngtccagatc atgacaactg 540 

gcccaaggga gcangggacc tttctggtgt caaaaaagat gaaaggagac ctttaaggag 600 

aaagctaagc tcttgagctt gangaaaaca aggggtcctt ccatncccca aatgccttta 660 

catttttgga ggatatgcct nnnggnacnt ttttaaaaaa gcttatnagt tngntatggg 720 

naaaacaatt ttttccttan tttttaaagt ggntaataaa tnaaantcct aatggnaaaa 780 

aaactantcc ttggnaanta ttttccaggn cttnantgtn cccncn 826 

<210> 769 
<211> 802 
<212> DNA 
<213> Homo sapiens 

<400> 769 

gnnnttctaa tgctgttcta atgcttgtca atncttgana cgttcatcga ttcgggaagc 60 

caagcctgga gctgcaggtc ccccggcatc tctctctgtc ccggcagccc aggatggcct 120 

ggtgccccca cctgctgcag caggagcccc aaggagtgct agctgagggt ggttgctggg 180 

gtggtcctca tggacagtga ggtgtgcccg ggtgcactga gggtggtggg aggggatcac 240 

ctgggttcca ggccatcctt gctgagcatc tttgagcctg ccttccggtg ggagcagaaa 300 

aggccagacc ctgctgagtt agaggctgct gggatccact gtttncacac agcgggaagg 360 

ctgctgggaa caggtggcag agaagtgcca tgttngcntt gagccttgca gctcttcagc 420 

tggggactgg tgcttgctga aacccaagag ctgaacagtg aggaggctgt ccaccttgct 480 

tggctcactg ggaccaggaa agcctgtctt tggttaggct cgtgtacttc tgcaggaaaa 54 0 

aaaaaaagga tgtgtcattg gtcatgatat ttgaaaaggg ggaaggangc cnaaanttgt 600 

tcccatttta ttcaagtatt ggaaaatatt tggcccccct ttggctgaaa ttctttttgc 66 0 

aanaactaac tgngtggctt gttcncttac cctttttcan gnttaattgg tttnaatttt 720 

ttgcattgaa attaaagacg tttttaaatt tcntttncaa naacaaaggg cttanatncc 780 

ngantcnana nattggnant tc 80 2 

<210> 770 

<211> 1157 

<212> DNA 

<213> Homo sapiens 
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<400> 770 

cccttttttt tttttcccnn aaaaaaanat tggggncccn tttttttggg nttttttttc 60 

ccnaaaaaaa aattgggncc ctttttgggg ggnntnaaaa aaannnnnnn nccccccntt 120 

tttttggggn nnnnnaaann tnnnnnncnn nttnnnnnnn nnnnnnnnnn ggnnttnnng 180 

gggnnnannc criccccccaa tttcccggnn attnttccgg gcccaatttt tgggaccccc 240 

cagggnnnag aataaggccc ggggnttttt tttncnaggg ncccaaaagg gcccttgggc 300 

caaaggnaaa tccnttggga aattttggga atttggccct tggnanntcc caataccggn 360 

aaaaatgggg aaangnaaaa aaggnttncn ccaaattggt tggggggggg ttccaaagat 420 

tttcattggg ggtncntggg ctttcaaccc naaggnaang ggtttncttt caaaaaatta 480 

cctttaattg ccattaagca attcccaang gttannaaag ggtgtttntt ctcanctatg 540 

cttcganagn gaaaatcaac naatggaaaa tgtgttgtaa ttggtctgca ntctacanga 600 

gaagctagaa cattagaagc tttggaanag ggcggnggag aattgaatga tntttgnttc 660 

aactgccaaa gagtgttgtt gcagtcactc atttgaaaaa ctattttcct gctccagaca 720 

ngaaaaaaac tttatangtt tactaggaat cgatttgaca agcnttcang taacaaacag 780 

ttctnccaag agatatcctt gttnaagaan nattanaata ncnngaaagc ggaaanngtg 840 

aataaatnnc ttcnagaagc ccaaaaannc acngaanaag tatggtgggn cttactggtt 900 

agcacgttct tgacnacaga tggaaattga antctngatt ncctctgatt antgaatgaa 960 

aaggtgacta ttnaanagct cttnanatac catgagtntt tggancattg attgaccaat 1020 

ttcaanncca tttttangat ngaattntta tnaatgattn attnanaant gannnccttn 1080 

gtttaaatta nnaaanaanc cntcnaaana cnanagggga tttataaaat ctaataanan 1140 

ttttnnncnt ntnaann 1157 

<210> 771 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 771 

ngncctttna tnccttntga anccntttgn aattnctcnn nnngttgatc ccatcgattc 60 

gaattcggca cgaggtggaa gaaaattttt tgctgcttct ggttnccaga aaagggagcc 120 

attttaacag acacatctgt caaaagaaat gacttgtcga ttatttctgg ctaatttttc 180 

tttatagcag agtttctcac acctggcgag ctgtggcatg cttttaaaca gagttcattt 240 

ccagtaccct ccatcagtgc accctgcttt aagaaaatga acttatgcaa atagacatcc 300 

acagcgtcgg taaattaagg ggtgatcacc aagtttcata atattttccc tttataaaag 360 

gatttgttgg ccaggtgcag tggttcatgc ctgtaatccc agcagtttgg gaggctgagg 420 

tgggtggatc acctgaggtc aggagttcga gaccaacctg accaacatgg tgagaccccc 480 

gtctctacta aaaataaaaa aaaaattagc tgggagtggn ggtgggcacc tgtaatccta 540 

gctacttggg aggctgaacc aggagaatct cttgaacctg ggaggcanag gttgcaagtg 600 

agcccgagat cgtgccattg cactccaacc agggcaacaa gagtgaaact ccatcttaaa 660 

aaanaaaaan gaaaactcga gcctctagaa ctatagtgag tcgtattacg tagatccaga 720 

catgataaga tacattgatg aattttggac aaaccccann 760 

<210> 772 
<211> 777 
<212> DNA 

<213> Homo sapiens 
<400> 772 

gaaancccat ttnnnnnttc cncttcnaat cccttggnta ctcgntcttt ntgcaggatc 60 

ccatcgattc gaattcggca cgagctctac taaaaataca aaaattagct gggcgtggtg 120 

gcacacacct gtaatcccag ttacttggga ggctgaggca caagaatcgc ttgaacccgg 180 
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gaggcggagg ttgcagttag ccaagatcgc cctgctgcnc tccagcctgg gcaacagagg 24 0 

gagactctgt ctccaaaaac aaaaacaaaa actgttagtg aaggttccct gggacttttg 300 

atattttaaa aattgttctt atgactagta gataaattca ttgccataat gaggctagct 360 

cccagataaa cagtgtattt tcttcttttt tttttttggt gagtggtcca gagctttaag 420 

ctacttttcc agtagtttgc cactttctcc gaggtanttt ggctgctctt tcagtaatgc 480 

taattgtgtg tcaaattttg tctacaacag taggcaacag atgaagataa gttggttgaa 540 

tgtctccagc actatgcatc cctattttct atttattggg gtacactcac tttcagtaat 600 

gngtttcaaa ctggtatttt ttaaaaaaca aatcaatgta aggactgaag ttgaaatanc 660 

caatgtaata aagttaatta gggttatttt taaaaaaaan aaaaataana actcnagccc 720 

tctagaaact atangtgagt cgnnttacct tgaatcccag accttgataa gatacnc 777 

<210> 773 
<211> 782 
<212> DNA 

<213> Homo sapiens 
<400> 773 

gnntnnattc ccctttcnaa tncttggcaa acgctctctn tgttggatcc catcgattcg 60 

aattcggcac gagacagtct cgggtttcat attttgctgt ttttgatgga catggaggaa 120 

ttcgagcctc aaaatttgct gcacagaatt tgcatcaaaa cttaatcaga aaatttccta 180 

aaggagatgt aatcagtgta ncncccgccg tgaagagatg ccttttggac actttcaagc 240 

atactgatga agagttcctt aaacaagctt ccagccagaa gcctgcctgg aaagatgggt 300 

ccactgccac gtgtgttctg gctgtagaca acattcttta tattgccaac ctcggagata 360 

gtcgggcaat cttgtgtcgt tataatgagg agagtnaaaa acatgcagcc ttaagcctna 42 0 

gcaaagagca taatccaact cagtatgaag agcggatgaa gatacagaaa gctggaggaa 480 

acgttaaggg atgggcgtgt tttgggcgtg ctagangtgt cacgctacat tggggacngn 540 

cantacaagc gctgcngtgt nacctttgtg ccccgacatc agacgctgcc agctnacccc 600 

caatgacagg ttcattttgn tggccttgtt atnggctctt naaaggnctt tnccccatna 660 

aggaagccng tggaactttc atcttgncct gnantcgang atnaaaaagn atncagaacc 720 

cggggaaggg gaaaatcctn aannctgact tcccggtttc caaaccagtn ttgnaacaaa 780 

nc 782 

<210> 774 
<211> 793 
<212> DNA 

<213> Homo sapiens 
<400> 774 

gnannngccn cgnttttgat tccccttntt caaatccttt gnnaatcgcc ctcnctgttt 60 

tgatcccatc cgattcgaat tcggcacgag atggcagttg cttttgaagt atatgatgnn 120 

ttcctccact acaaaaaggg gatctaccac cacactggtc taagagaccc tttcaacccc 180 

tttgagctga ctaatcatgc tgttctgctt gtgggctatc ngcactgact cagcctctgg 24 0 

gatggattac tggattgtta aaaacagctg gggcaccggc tggggtgaga atggctactt 300 

ccggatccgc agaggaactg atgagtgtgc aattgagagc atagcagtgg cagccacacc 360 

aattcctaaa ttgtagggta tgccttccag tatttcataa tgatctgcat cagttgtaaa 420 

ggggaattgg tatattcaca gactgtagac tttcagcagc aatctcagaa gcttacaaat 480 

agatttccat gaagatattt gtcttcagaa ttaaaactgc ccttaatttt aatatacctt 540 

tcaatcggcc actggccatt tttttctaag tattcaatta agtgggaatt ttctggaaga 600 

tggtcagcta tgaaagtaat agagtnttgc ttaatcattn ggaattcaaa catgctatat 660 

tttttttaaa aatcaatgtg aaaacataga cttattttta aattgntacc aattacaata 720 

aaaataatgg gcaattaatt tttnaaaact ttttaaaata gnatgctcat atttttaaaa 780 
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ataaaanttt tnc 793 

<210> 775 
<211> 1009 
<212> DNA 

<213> Homo sapiens 



<400> 775 

agcntttttt ngaanttccc ctttnnttna aaaatcccct tttttggcaa aaaattnccc 60 

ccntntntna nngtttttnn gatncccaca tncngnaatn tncgggcncg ggnnactgnc 120 

nannggcncc cttcgggggn ccngtgntaa gncnatnctt gtntntanaa agntggnnnt 180 

nttttncgat ngngactatt gncnacnctc ttccntnttg gcagngngtc tgganggttg 24 0 

nggtngctca tntggntaan ccnatcctgg ngaccaanng gccgnggtgn gcntgcaagc 300 

tttgnccacn tgggaaancc gnnagtggtn gtctcanttg cntgntgggn ncntgncccc 360 

atcttgnctg ctgnancctt ggggagcagg nnctnggtng tggtnctgcc tgcttgctgc 42 0 

tngttccccg ggcatgcgtn nncannaagg gncatgcntn gggcaanaag gtgcgtggnc 480 

ancgtnngna tnnnnaggac caccntgggt cgngaatcnn tgggttncct gataggaacc 540 

ntnaannnct gcngntttta ttaaatggga nnanangggt ncanttcaaa gccagtnnaa 600 

tgcccttatg gaanggngtg natnacatan cnnnntatgt gtcntanann angaaatcgt 660 

tnnncaaatt tnnacaanaa tntttntaan aaagggtatt tnantntngg tgaaanaaca 720 

angntttaaa gtnaaatgnt tntancanaa ttaantaaac nggtnttnat gattncttac 780 

naaantaacn atncnnaagc atttacngct tanangtccn cnngatactn ncanaatatg 840 

gnnnnaattn tannanatng cgataatctn gnananactn tcatnnnnna tngtgtaatc 900 

antanntacn tgatttnnnt naaatgaaaa catntgatnc aagattaatn cattanntat 960 

acnaaaatnt tcanatanta natntacata taatggtttc naataaacn 1009 

<210> 776 
<211> 785 
<212> DNA 

<213> Homo sapiens 



<400> 776 

gnnnnnnntt cccctttcta atcncttgga nntcgctctn tntgnangat cccatngatt 60 

cgaattcggc acgagagaaa cacaggtgtc gtgaaaacta cccctaaaag ccaanatggg 120 

aaaggaaaag actcatatca acattgtcgt cattggacac gtanattcng gcaagtccac 180 

cactactggc catctgatct ataaatnngg tggnntcgac aaaagaacca ttgaaaaatt 240 

tganaaggag gctgctgaga tgggaaaggg ctccttcaag tntgcctggg tcttggataa 300 

actgaaagct gagcgtgaac gtggtatcac cattgatatc tccttgtgga aatttgagac 360 

cagcaagtac tatgtgacta tcattgatgc cccaggacac agagacttta tcaaaaacat 420 

gattacaggg acatctcagg ctgactgtgc tgncctgatt gttgctgctg gtgtnggtga 480 

atttgaagct ggtatctnca agaatgggca nacccnaaag catgcncttn tggcntacac 540 

actgggtgtg aaacaactaa ttgtcggngt taacaaaatg gattcacttg accaccctan 600 

aggccngaag agatattgan gaaattgtta aaggaagtca gcacttncat taagaaaatt 660 

ggcctacaaa tccnnganac aataancatt tgtgccaatt tnngggttgg gaatgggtga 720 

ccaacattgc ttggagccca agtgnttaac aatgccttng gttnaaaggg antggaaaag 780 

ttacc 785 

<210> 777 
<211> 1366 
<212> DNA 

<213> Homo sapiens 
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<400> 777 

ananaanann annnnnnnaa ggnnaanana nnnnnannnn naanangnaa anananannn 60 

tnnanaannn aagnngnttc nanncttttc aaagcttgga aaacgcannc aarmnnnggg 120 

aaagcaagaa agaacagcta aagnnngncn cagaganagc ttttangang tntangaaga 180 

aggaatannn gnggncaata nnnnannnnc ngaaantatc atganacnca aatganggan 240 

aaggcagcac aagctgngca aacagctatn gngacggggg ggccgggaga gnctaaangn 300 

cananatnca atatataagg actgcatgcn aagggatacn aaacaagnan actnntctag 360 

gaagaaataa ntnttgacnt ancnnacntt cataacgaat agcaccgtac atcgagncaa 420 

ccaactaana ggnctaagga aatggcaaan nacnttaatn nntgagcnaa ggaagggngt 480 

atngnccnan anngaaatgc ntcntaacca anttttaatn gtaacggnat nangatnaan 540 

ncntnanccc acgcaactca aaaanattac attanntaaa aaaganctat ancaaaacta 600 

gtnttcaaaa tngnacgagn aaatgggnaa nantttntnn ccgggaaaat tggnagagat 660 

ccanaaacac tggntnaggg naatanatgn ccgcccnaaa aaaccntnac cataggnatn 720 

ggctancata gangagatat ancnatnagg ggatcaanan cntaggnatt ngaaaantaa 780 

ncgagttaaa acancnagat nnggnantac gaganatagc ttggacgngt atcaaatcgg 84 0 

accctnggat gggcntangg aaaaanaaaa aggntngagn gaanttcctc anaggaanng 900 

tganagagcn aaanaanatn aagggccttg gngaaaangg aaaaacagat agngtcatnc 960 

natatatncn natgananan tggggnaatn taatctacnn tanatnnggg ggaaaaaaat 1020 

cnnncatgac nnnaaaanga gntaatgnna nnatgagaga ttaaacnnat aaaacnagag 1080 

aantttgngn aaanctgnga gataaaaaat aaataaattc tntntggaac atntanaccn 1140 

tctatnnaaa aaaaagaggg gaaaccatct ngattatgca cananaaatn tnacntngng 1200 

gaaataaatn gggnacaata acatatatgn ggatgtacan tnntggncng aaaaactata 1260 

caacntgaga nnnnacnang atataaagcn nnaggnagtn tatangggca tcatcaangg 1320 

gaagntataa agcaactgna nnctcatata naaaactgnn cnncaa 1366 

<210> 778 
<211> 775 
<212> DNA 

<213> Homo sapiens 
<400> 778 

gntttnnatn cctctttcta atnncttggc tactcgntct ntctgnanga tcccatcgat 60 

tcgaattcgg cacgagagat tatgagcatg tagaagatga aacttttcct cctttcccac 120 

ctccagcctc tccagagaga caagatggtg aaggaactga gcctgatgaa gagtcaggaa 180 

atggagcacc tgttcctgta cctcccgccg ccgaacagtt aaaagaaata tacccaagct 24 0 

ggatgctcag agattaattt cagagagagg acttccagcc ttaaggcatg tatttgataa 300 

ggcaaaattc aaaggtaaag gtcatgaggc tgaagacttg aagatgctaa tcagacacat 360 

ggagcactgg gcacataggc tattccctaa actgcagttt gaggatttta ttgacagagt 420 

tgaatacctg ggaagtaaaa aggaagttca nacctgttta aaacgaattc gacttgatct 480 

ccctatttta catgaagatt tttgttagca ataatgatga agttgcggag aataatgaac 540 

atgatgtcnc ttctactgaa ttagatccct ttctgacaaa cttatctgaa agtgagatgt 600 

ttgcttcttg agttaagtag aagcctaaca gaaggagcca accacaaaga attgagagaa 660 

atnaacaact gggccttngg aaagaaangc nggccaagct gcttgagtaa tagtcaganc 720 

ctanggaaat gatntggtta atgaattcac cccaggncac acccngttga agagc 775 

<210> 779 
<211> 781 
<212> DNA 

<213> Homo sapiens 
<400> 779 
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gcttttnann nccctncttt cnaancctct tcaaatcctt ggntatcgtt ctntctgnng 60 

gatcccatcg attcgaattc ggcacgagag acaaagaaaa aggtggcaat catagaagag 12 0 

ttagtagtag gttatgaaac ctctctaaaa agctgccggt tatttaaccc caatgatgat 180 

ggaaaggagg aaccaccaac cacattactt tgggtccnnt nctacttggc acaacattat 240 

gacaaaattg gtcagccatc tattgctttg gagtacataa atactgctat tgaaagtaca 300 

cctacattaa tagaactctt tctcgtgaaa gctaaaatct ataagcatgc tggaaatatt 360 

aaagaagctg caaggtggat ggatgaggcc caggccttgg acacagcaga cagatttatc 420 

aactccaaat gtgcaaaata catgctaaaa gccaacctga ttaaagaagc tgaagaaatg 480 

tgctcaaagt ttacaaggga aggaacatca gcggtagaga atttgaatga aatgcagtgc 54 0 

atgtggttcc aaacagaatg tgcccaggct tataaagcaa tgaataaatt tggtgaagca 600 

cttaagaaat gtcatgagat tgagagacat tttataggaa atcactgatg accagtttga 660 

ctttcataca tactggatga aggaagatta cccttagatc atatgtggac ttattnaaac 720 

tatgaagatg tactttnaca gcatncattt tacttcaagg cagcaagaat tgcttttaga 780 

c 781 

<210> 780 
<211> 783 
<212> DNA 

<213> Homo sapiens 
<400> 780 

gnntttnnan nnccngnttt ctaatnctnt tcnaatnctt tgnnancgtt ctntatgcan 60 

gacccatcga ttcgggaatc tcctagaaaa gttgtgattt tcgagccata tccttctgtg 120 

gtagatccta atgatcctca natgttggcc ttcaacccca ggaaaaagaa ctatgatcga 180 

gtaatgaaag cactggatag cataacttct atcagcnaaa tgacacaagc accatatctg 240 

gaaatcaaga agcaaatgga taaacaggac ccccttgctc atcccttact gcaatgggtt 300 

atatcaagta atagatcaca tattgtgaaa ctgccagtta acaggcaatt gaagtttatg 360 

catactccac atcagttcct tcttctcagc agtccaccag ccaaagaatc caattttaga 420 

gctgctaaaa aactctttgg aagcaccttt gcatttcatg gctcacacat tgaaaactgg 480 

cactccatcc tgaggaatgg tctggttgtt gcttctaata cacgattgca gctccatggt 540 

gcaatgtatg gaagtggaat ctatcttagt ccaatgtcaa gcatatcatt tggtactcag 600 

ggatgaacaa gaaacagaag gtgtcagcca aggacgagcc agcttcaagc agtaaaagca 660 

gcaaatacat cacagtcacn ggaaaaaagg acagcaatcc caattcctgc caaagccgta 720 

acttaaaatg catagncctt atgtgaaagg gatcaccttc atctggacct gcacaaacat 780 

783 

<210> 781 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 781 

gnnntncgcc ttcaatnctn ttcantctnt tcaatctttg aatcntcttt gttgtccatc 60 

gttcaattcg gacgagaccc ttatggcaga tccccacagt ctggggcaga agaggcgtcg 120 

aggngccaga agtgncggca gcagcagccg cagcagccca aagagaggca agagaaagag 180 

aaagcggccg gtggaggggt nnncggaaga gctggtcccc gtggttgagc tgggtccccg 240 

tggttgaatt ggaagaggcc atagccccag gctcagaggc ccagggcgct tgggtctggt 300 

ggggacgcgg gggttgcccc caatggtgca gctgcagcag tcaccactag ggggtgatgg 360 

agaggaaggg ggccacccca gggccattaa caaccagtac tccttcgtgt gagccaaccc 420 

cacccgctcc acccttttta aaccccccag cccttgctcg tgagattggg cttgggtagg 480 

gacagaagag gcccgaaatc cctcccccat gcttnctgac ccttgtttgg ccaaagggca 540 
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tctttgatgg tacaaagcag angcttcggg anaagcttcc gtcacaacac tncaaggtcc 600 

cttcccaagg gcaaggggat ttnggcttca tgagctnctt tgaggggctt ttttttggtc 660 

annccccacc ttnggggcca tttttcccaa ttaacttacc cccaacccca agncanggtt 720 

nagggggnaa agggctttcn anttccatta aagggggttt gtttgttgnt gttttaaacc 780 

aaaatgggga aancnn 795 

<210> 782 
<211> 886 
<212> DNA 

<213> Homo sapiens 



<400> 782 

cggnnnnnnn gnagcccntt tggnaaangc ctctaaggga aangcctttt tgaaaacnan 60 

angaaaacct ntgggaaaag nccncannna ttttngngaa annggcnnga gcnnanantn 120 

ggacacngtt ntaannnnan nagngnnngt tttnnganan agggnnnnna gnggnannna 180 

ngngnnggag ggaannaagg nanagnannn ggnagnnaag gnnnnaaaga agnagnnang 24 0 

gaganggnnn gnggnggggc atgangnggg nncagaggca cgaggagccc aagaccatca 300 

cngangagna ngagcagggn accnacatnn acnnggacna cgagaagngg ggccagcgga 360 

agaaggaagg nagnacctng agnaccgnta ccaggaggan cgggaccnac agngacanag 420 

gnccnnnncn anacggannn nanaaacgng aagcaggann nnnanggacc aagggaaggg 480 

nncngnncnn ggaaaganng ggagggaggn ncgaaggcaa aggggggann cgnnannncc 540 

aggaagnang gaaggggggn cgggaggnna annganaaga ngaaccnngg gggnncaggg 600 

gggcgagggn agcanaannn nnccnnagnc aanngaaggg gananaagag ngggaaaann 660 

aannagaaag agggaaaana agnnaaggaa anaaaagang ngnnaannng gganaaaana 720 

nggnganann grmgganaaa ngngnannan aaaanngagg aggncanngg gnaaanaana 780 

nggggagggn nganananag ngaannagac aaggaanagn gaannagngn anagnanngn 840 

gnannaaagg nannggggna anaagnanna nannnnnagn gaagan 886 

<210> 783 
<211> 805 
<212> DNA 

<213> Homo sapiens 



<400> 783 

cnaatncttg ctcttgncct ntttcnaatn cttggcnact cgctttctnt gcggatccct 60 

cnnganncna tcgttcgaat tcggcacgag cacaaggaga agaaagttaa ttaacattga 120 

aagatgagaa gacatcttgg aagacttgaa ttgggccttg gaagaagaac agccattcaa 180 

atagatagaa ttgtggtagc aaaggcatac ngntcggaaa gtatagatct ccagggacag 240 

tagtcatggg gttggggcac tgttggaatt taaggttgga aggatatatt ggagcccctt 300 

gaatacggta acaaggcaca ccttgggcag tggagagtta tcagagtgtt tgaaaaggag 360 

ggttattgag taaataaata gactggtact ttaggaattt taaaatgtgg atcattgtac 420 

tactaataac tatntatttt atatttacta tctactaagt aatttacatg tattttcttg 480 

tactgactgt aaaccttctg ggtgtgggtg ttttaagtgc cattttactg ataaagaaac 540 

tgangcttaa atagntgaaa tanntcaccc tgttagtgag tggcacaatg acaagtcann 600 

atcttanggt tgccnanntc caaaanncat ttaaanttnn agnatnattg annnttttnc 66 0 

cttatggcnt nnnaaatttg gggagccatt attgaaatcc nttacnacnt angaattgnc 720 

caaaaaaaat actttttggg gaaaactgga tttattaatt atccaaaata atttnantgg 780 

cttgnttggc ttntttccac tntnc ~~ 805 

<210> 784 
<211> 776 
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<212> DNA 

<213> Homo sapiens 



<400> 784 



taatgctggt tactgccctt 
atcgattcga attcggcacg 
ataaagcctg caagttacta 
gcagngtgcc aaaggtgaaa 
tcgtanaaaa acgtctttac 
tattggatga atcatttgcc 
gagtatgtgg gtatagaaat 
ttaatgggta gatcataaat 
tgtgcccaga tatgtnttgg 
ttgctatagn gnnactattg 
gccaagtaaa ctgtactgtn 
aanttnagaa cngatgtctg 
catntttcgg ngctgaggtt 



caaatccttg caatcccttg gnaancggnc cngcngaccc 
aggttatatt aaattattct ttgntnttct ttgtctttta 
aattgnagtt ncataaattc tgtagtnaag tatcatcttg 
angntgcttn ctctaacaga gaaattctta gngactccag 
aacctgaata agatnganga attgngaaca taccatggcc 
ggnggctana ncagactgta gggtttgtga tggatntatg 
catgaatntn ccatttgnnn ncagagattc aagcntanac 
gacagaatga attcaaaacc tagcacgtgc attgtaaatg 
aaatggcagn tccttggggt catgtntcta ctggcaaaat 
nantgtaatt ataaaattna tcannattat ncaccgattn 
cataggaatt ttgggaattg tgcanaaatt ggatcaattg 
ggcttaaaaa tttatcnggg accacnnatt angaaactna 
cattgnccaa ggccangaag gtntttnccg aaaanc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 



<210> 785 
<211> 778 
<212> DNA 

<213> Homo sapiens 
<400> 785 

ttngaaaacn ccttngcttn gttnccccta cngaaaccct tttgaaaacc ntttgcnann 60 

tcctctttnt gnaggatccc atcgattcgt gaaagaggag atcggtgacc tgggctcctt 120 

atgtgcctga atgagtttga gtttcctgtt aactccaaat caacagtatt ttcaacaaga 180 

aatgtgcaat tgaaatcaag tgctgtttaa gtgcagctag gantccacag gaagacactt 240 

gcagtgaaca gagttatgga gcagcaaaaa cacagatcta tttggaaaaa gagaaaacat 300 

atgcgttgta ttttgcttca attataaaat accatcctct caaaggtggt tctaaattac 360 

aaaggacttt gatttctagg tagattctgg gtagagactt cctttcatat tgaggcatta 420 

atgacacctt ttaacctggg aagcaatatg actggagttg tactttgaga agattaatca 480 

ggtttggttg cagaatgaaa gagaagatga agtcaagaga ttggtttaga ggctctagca 540 

gaagcttagt catatttcaa aatgatcaaa tatcaagaaa aattctgagc tgcataactt 600 

gtataaagta attttcagtg atttttttca tggttatgat aaaagaactg gattagcaga 660 

aacttttacc ctgaatcaag atttaatttt tctttgagct catcttaagg atatcggaac 720 

atagggagca aacgatggtg tggctgcctc antgcttgaa ttttaacngt tttgaaan 778 

<210> 786 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 786 

ngccccccct ttcccccctn ttgaaanccc ctttggnana nnccnntttc aaatcncttg 60 

naaatccttg gcnactcgtn ctntctgcag gatcccatcg attcgaattc ggacgaggag 120 

aggatcactt gagcttagga gttcaaatcc agcctgagcc aacataacaa gactttgtct 180 

ctaaacaaaa cagttattgt ttaaagaatc tgaaatcttc atctttaatt caggtagccg 240 

tgaatcgagc ccaagtttgt ttgatatcca gttccaagtc tggagagagg catctttatc 300 

ttattaaagt atcgagagac aaaatatcag acagcaatga ccaagagtca gcaaattgtg 360 

atgcaaaagg gctatcaaag ggaggctttt tacagagaac taaggaagag aaggaggttg 420 
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ttaaagagac ttgagatcag aaaaagatca agaacaactt gaatctcaaa gtatgaattt 480 

gaagtatttt gctgagcaaa catttgaatg cctgtatgta ccgtaatcct ctatcactgg 540 

ggtccccaac cccggtacca gcccgtggcc tgctagggac tgggccgcac agcaggaggt 600 

gagcagtggg tgggcaagcg accattccca cctgagcttc ccctcctgtc agatcagcag 660 

cagcgttaga ttctcatagg agtgcaaaac cctattgtaa actgcccatg ccaagggatc 720 

tangttgcaa cgcttcctta tgagaanttg aatgcctgan ngaactgtca ctgncttcca 780 

tnaaccccca gatgggtact ngttc 805 

<210> 787 
<211> 775 
<212> DNA 

<213> Homo sapiens 
<400> 787 

ccttggnnag nngccccctt naaanccttt gaaaaccctt ggcaaangcc ctnncngnnn 60 

gatcccatcg attcgaattc ggacgaggag aggatcactt gagcttagga gttcaaatcc 120 

agcctgagcc aacataacaa gactttgtct ctaaacaaaa cagttattgt ttaaagaatc 180 

tgaaatcttc atctttaatt caggtagcac cgactcgagc ccaagtttgt ttgatatcca 24 0 

gttccaagtc tggagagagg catctntatc ttattaaagt atcgagagac aaaatatcag 300 

acagcaatga ccaagagtca gcaaattgtg atgcaaaagg gctatcaaag ggaggctttt 36 0 

tacagagaac taaggaagag aaggaggttg ttaaagagac ttgagatcag aaaaagatca 420 

agaacaactt gaatctcaaa gtatgaattt gaagtatttt gctgagcaaa catttgaatg 48 0 

cctgtatgta ccgtaatcct ctatcactgg ggtccccaac cccggtacca gcccgtggcc 540 

tgctagggac tgggcccgca cagcaggagg tgagcagngg gtgggcaagc cgaccattcc 600 

cacctgagct tncccctcct gtcagatcag cancagcgtt agattctcat aggagtgcaa 660 

ccctattgta aactgccatg cnagggatct aggttgcacg ctccttatga ggaattgaat 720 

gccctgatga acttgncact gncttccatc acccccagaa ngganctggc taacc 775 

<210> 788 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<400> 788 

gaaacccttt tgtnaanagc cncttcaacc cnttctaatg cttggcaatc gctctntctg 60 

cangacccat cgattcgaat tcggcacnag attatttcca aagcagccta cagtagaaaa 120 

tagtcattat ggcagcagct tctgatgttt ttgtttggta ggttttctga tttcaatata 180 

tagaatcata ttcatagagt atcttctntn ccgcctngca caaagtaccc atttaaaatt 240 

tacatgcaca gttcattgcc acctttctta ggcctatgca tagttaataa ggttataatc 3 00 

tactcaacat ggaaaatgga gcctatttgc aaacacacaa gtaattaaag taccaattct 360 

ctcttagttt ctttttttat agttggttta ttttgcaatt ataaatgtta aacatcccta 420 

gagatgaaag ttaaaatggt tgatcacaga tcagtagcaa aatacaaatt gacaattcaa 480 

aattataaat aaaactctgt tgaggatgtt taactttgag tctccaaatt taagagctaa 540 

gcttggaaga aacaaattta taggttatat ttccctctta aattaaanaa acaaacttcc 600 

tctggcagta gtttggtgaa ttcctttcat tgnaatgata ccatgattac aggatcaaaa 660 

atgcttaact tacttgccat tctgctcaca tcatcacagg ttgttntttt tttaaagcac 720 

tcnatgtagg cattttaaac cttcnggata accagagtat cttttgagaa annc 774 

<210> 789 
<211> 773 
<212> DNA 
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<213> Homo sapiens 
<400> 789 

ngcccctttg aanccnacng aaatcctttg gcnantcncn ctntctgtng gatcccatcg 60 

attcgaattc ggcacgagag cagatttgng ataaacntnn tgnaggttna accnaagggg 120 

aactnntggt gcaactatgn ngnttggaag atgctgcnta tgtttattga ggattgcann 180 

anananatcc tgaatnctcg ccntttncaa aggcttggat aaagcactca agccagctac 240 

atatgtatag aacggnttaa aatcnatgag gaagcctgga ctaaatatnc catnggactg 300 

gngccnanaa ngctgncgat gaactttgna tctggnnaga agtntaaaga atggcaggat 360 

nantnnctaa ngatgaattt cannacnggn nnnccaccan tcttnaatnc tttaagatca 420 

ttatacgaag ncnangaaaa ggtggcaatc atngaanaat gngnatnatg ttangaaacc 480 

tctctaaaaa gntgacggca ctttaacccc natgatgatg ggaaggaggn accaccaacc 540 

acattanttt ngggtccagt actacttggc acanccttat nacgaaactg gncngtncnt 600 

ctattgcttt gggagtaccn taaaatacng ccntngngag tncacctnca atgaatnnaa 660 

nctctttntc anganagctn nngatccata ngacntgctg ganatnttta aggaancttc 720 

nanggnggan tggattaggc ncaggccntt ggacacancc ntncttnatt trie 773 

<210> 790 
<211> 953 
<212> DNA 

<213> Homo sapiens 
<400> 790 

aanannnngg gnnnnnnnnn nnnnannnnn nnnnnnnnan nggngnnttn aaanccttnt 60 

aanngncnnt ncngcttnaa accttggnaa ncnccgcccn nttgeannaa angngaannn 120 

atgcttngtg aagectgann ccaaanctna aggnanggac ctggatcccc ttatatngaa 180 

naancggtnt ggaggaanga gnntgtcngg gaggatgggg cagaaaatga ngnnggcaga 240 

ntggncccgg gggctctgea naccagcctt ggagectget cattctgggc ccttgctgcc 300 

aagganccca gcctnaccta gcangaaang anatgaaagc ccttctccca ngaggtaggg 360 

tetaggctge ccnaacttaa atgeattnag aaanctcnta gatgtggaaa natttttneg 420 

aacctgaaaa tgcagctggt anaatntcaa tgggaagcat aaatncatgt aaaatataat 480 

tnagntngaa tatnanngta aaaatgeact tttnngcggt gtgacngatc ctgggnnccc 540 

annatctgnn attnaagngn tttacnaang gaanggaaag gacctttncc taaactacct 600 

ttttgaacag ancattaaga angnncnttc ttttaagnaa aaaaaaatca aattttgang 660 

aaaantggna ttngaatgtn nagaaaaang gatananaan aaaanccaat nntaannacc 720 

nannctctct gganttcnac tatctccact acntacntnt acntatngcg ntaanatnna 780 

ctnttacntc nnnntant cn cacanacntc ntcnaacnta atnangenen canaatcctc 840 

tatannatnt antgtnnntc acannnenna cnggntaant ntnnncaacg ccatatcacc 900 

nctnnnatcg ncnagntana taacacntat atcgncactc ncacananac tec 953 

<210> 791 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 791 

tggnancgen ctntntgttt gatcccatcg attcgaattc ggcacgagga tcattgttaa 60 

ttagtgacat agtaacatct gtagcagctg gttagtaaac ctcatgtggg ggtggggtgg 120 

gggtgtattc cttgggggat ggtttgggcc gaatggggag tggaatattt gcncttcncc 180 

tgttttaaat tctaggatag attttaacat ectttgeggt cccagtccaa ggtangctgg 240 

tgtcatagtc ttctcactcc taatccatga ccactgtttt tttcctattt atatcaccag 300 
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gtagcctact gagttaatat ttaagttgtc aatagataag tgtccctgtt ttgtggcata 360 

atataactga atttcatgag aagatttatt ccaccanggg tatttcannc tttgaaacca 420 

aatctgtgta tctaatacta acccaatctg tttggatgtg gattttaaaa aaatgtttgc 480 

taaacctacc caaagtnaga tttacctgna tttaaatggc ctttngggtc ttgaaaaagc 540 

tttntnaacc tcttggcttt aaaatgcgtt ttattctnga taagatactt cnaaatancc 600 

tnncaaaagg tgttngatnc naattacttt aaaataaaac ctgtaattgn ataatgncat 660 

aatgntgntc catgcctnan tccccttcta gnnntanaaa cntnantaan aantatatca 720 

atnntcgatn aaatnntann actataaaaa ctncggccct cttananact tnatncttga 780 

agttctcant ataaccnc 798 

<210> 792 
<211> 788 
<212> DNA 
<213> Homo sapiens 



<400> 792 

ctnttgttct ttttgcagga tccatcgatt cgaattcggc acgaggcaga gctcacatcc 60 

tgtgcgcagc atcttctgtc ccctcatgtc cttccgccag ggggcctgcg tggtgacggg 120 

cagtgaggac atgtgcgtgc acttctttga tgtggagcgg gcggccaagg ctgctgtcaa 180 

caagctgcag ggccacagtg cacctgtgct tgatgtcagc ttcaactgcg acgagagcct 240 

actggcctcc agtgacgcca gcggcatggt catcgtctgg aggcgggagc agaagtaggg 300 

tcctgtcngc cctgctgctg tcctccatcc cacccctctt actccacctc gtgttgtaaa 360 

taaagtttcg gtggtcatgc tganggccgg ctcccagctc tgccggggac ggacagggca 420 

gaaggcancg ggcaacttca ggaacacggt gaaaaaaaaa aaaaaaaaac tcgagcctct 480 

agaactatag tgagtcgtat tacgtagatc cagacatgat aagatacatt gatgagtttg 540 

gacaaaccac aactagaatg cantgaaaaa aatgctttat tttggggaaa atttgggatg 600 

ctattgctta atttgnnaac cattntaaac ctgcaaatta aaccaagttt aacaaccaan 660 

caattggcan ttcattttta atggttttna aggttcaagg ggggaaggtt tttgggaagg 720 

tttttttaaa attnncgggn ccnnnggngc ccaatgcatt tggggccccg ggnccccaaa 780 
nttttttt ^ " ^ 

<210> 793 
<211> 806 
<212> DNA 
<213> Homo sapiens 



<400> 793 

gaatcccttt gcttctgtcc tttaagnnat cgttggaaca accatgnctt tttgtaggtg 60 

aagtgttctc tctgcatgca acagtaaaaa ttaatataat atttttncca caaaagaaac 120 

acttaacaga ggcnagtgcc aatttataaa atttatgatc taaaggggga aatcatggat 180 

tataaagtcc ttcagccctt tgggactcta aattggnggg ggattaaaaa gaatttaaaa 240 

taattttnga accgaattta ttttcccctc agtttttgag ggcattaaaa aggcattaaa 300 

tcaagacaaa tcatgtgctt gagaaaaata aaattaatga aaacncagca ctttatgttg 360 

gtttaacntg cancctnctt tggaggtaga atttatttat ttaaaattac tgggtgcatc 420 

angaacccat agggtgtaca aaangttcta ttaaaatctg cnttatagag acaaagaggc 480 

aggcaaatcc atgtnacaaa gggtaaagct tacagtttac aaactgngaa cgccanggtg 540 

taggatataa aaacgcactc ttgagaaaac anatggtcat cagggtgctg aaaacttgca 600 

tggtgctttt caacattagc ctttggtcca caaatttctt gtatttgaca ggatccatag 660 

tgtgccatgg ggcaaganac nattttgccc tctatggtnt tctttaaaaa ttttcanttt 720 

aaaaatacct cttttnncag gaatcctaat tttggcnccg aagcntattn ntggtnccac 780 

atttaccgtt gcccttgccn ttggan 806 
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<210> 794 
<211> 815 
<212> DNA 

<213> Homo sapiens 
<400> 794 

tttcaaatnc cttggcttta nccctttgtt tganntcctt gttcgaattc ggcacgaggc 60 

cttctctggc ctcaccaatt aggtcaaatg ttccttattt tgtgttgtgg ggcatggctc 120 

tncctgtgag gacctgtccc agcttggacc tccgccttcc tgcgactgta ttggtgtctn 180 

tccctctcaa gcctatgagc tcttgcaagg gcagggaccc tgtatgattt tgcctatcgt 240 

atgtcctcca gcccccagca cangcgcctg gtgtccagtg agagctcagc aaatactttg 300 

tgagttaaan gacangcggg cttggggtag atggatccgt ctgcctanac agggcangtt 360 

attcccgctt gtgagcaact cttaanagaa acttcatttt ttttcggcgc ctgcncgaac 420 

tttcaaagat gtttcccggc cangaacngt ggctcacacc tgtaatccca gcactttggg 480 

aggcttgaag tgggtngatc accttgaggt cangantttn tagaccagnc tggccaacac 540 

cggtgaaacc ccgtcctctn ctaaaaatac aaaanttaac tgggtgtngt tggtngggcg 600 

ctttgnantc tcactacttn ggaangctga ngcnatgaan aatttgcttn aaccccngga 660 

nggcngaagt ttcaattgan gtcnanactt nancccattt gcgccttcan accctggggc 720 

aacangtatc annaacttna acnattaaaa aatnaanana nctcttatcc ctttannaac 780 

nattattgan gntacntatt ntcntagaaa tccct 815 

<210> 795 

<211> 1050 

<212> DNA 

<213> Homo sapiens 



<400> 795 



tttctaatgc ttggctttga 
gcggcgctga cccggncggn 
ttcctgcctc caagacctgg 
cccatgggac ctagactcac 
ccctgttcca gctcccgacc 
ccagcnagcg gncttcagcc 
tttgcagaaa cccananggg 
ctgncctgtn ctgcatcatt 
cntgntcaca atcaancatn 
tacntcntan nnnnganttn 
ncatgnantn ttagtntntg 
anannaancc ntccttnnag 
atataanntn ntntnctnnc 
ggnnactatc ncatantaca 
nantnnnntc tcctnnanat 
natanngnnn anatcnannn 
nannnaanng tatnntngtt 
tacnccnntn anntaannat 



gncctctntt taaaatcctt tggcnactac tctgcacgat 
cccacacccg ctctttnctc ttctttgccg cggactccct 
gtgtctacaa ctgtgagccc agcttgnncc aaaggcagtc 
cttncccttg cctctatgaa accttctgct tgggcccanc 
tgcacttcct tgctgggact cangcctcca agctccctgc 
accgtcttcc cctttctttc gggccctgnt tgtnagcanc 
acctngtgcc ccttgcnaag nctgtcgcct tggtgcaaga 
ttncatggtt gncgggggtg tggggntnnn cnngncgnnn 
tatncctnan ntngggtatn acnaatggcc tnaagantgc 
tcangnnntn ttactaacnt ncnatngnnc ntnganatag 
atntanccnc nattgcagcc ncataattat cctacaccac 
aanntgncnt ctatgnaana gnctnnnaat gtggcnncna 
atcntannnn nntcctacgt nannnnncat nnncnctntn 
tcnntnannn cacccatnct nntntnanat ntctcntggg 
ncnctaatna ngatctctca nntacatgan ntanatnacn 
ngtctctcnt atnnnttatn nanngntcan nttacnnnan 
cnaaanntat ntataaancn ncgtnnnttt nnannagatg 
ctangctccg 
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<210> 796 
<211> 884 
<212> DNA 



<213> Homo sapiens 
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<400> 796 

ggnnntttng agctcggaaa tcncttnggt nnagcctttc nttgacccca ttgttcgaat 60 

tcggcacgag acggcctggt ggagcagctg tncgaccttt ncctggagtt cctgcacagc 120 

caggcacact gcatcggctt cccggacctg gggctgcctg tggtcctgca gntgaagtcg 180 

ttcctccggg agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag 24 0 

gttcaggaga actcggcata catntgcaag ccgccgccag agggttncct tnggcgtttc 300 

tgagcagcag gcagtggaag cctggganaa gctgacccgg gaagagggga cacccttgac 36 0 

cttgtcctac agccacttgg cgcaagcttg cgttgaccgg ggaagatcca acttgggaga 420 

tcaanngggc aaaagaaccg gcttggaaag acctggaact ttcccttgag atcaaaaccg 480 

aaanggaaga atgggcttga canggaangg atgaaggaca gggaagccaa ttttaaaaga 54 0 

ccctctttga cctgnacaag ctcttgaaaa aggacgacac ccgaggggat tcttcggaga 600 

nnagggatac tgangccccc tgagcacctc ggcatggggg tngggaagac cattgnaaac 660 

aaggaccaag gaaggaaggg ccnaaggaag ggacaagcan ncaaactcgg aanggntgna 720 

atgggncctt ngggantngg aggaacccca naaccccaaa aaggccgggg ggcttgggcc 780 

cccttggggg gaancttnnc aacaaaatnt gggccccaag ggggccccgg aaaggaacga 84 0 

aaccttggaa gggaatcttg ncaagcttct tanaaaaggg ancg 884 

<210> 797 
<211> 773 
<212> DNA 
<213> Homo sapiens 



<400> 797 

taatgcttgg ctctgtctnt tgttgacccn tngttcgttt gtgcctgagc acccacaatt 60 

tcaggattta gactgtgtgg gcacctcagc tttcctctgg ntgtaaccac tccttggtga 120 

nagagggaac tcctaaccan tcccatttgn caaaggctag gcaatcttca ttctgcttgg 180 

ctttagtcat tcttgtcatt gggctgcaga agaaaaacaa ctttgctggg tgatcccact 240 

gccttgattt cacctcggan cgaggctggg ccatgtccaa gtcttatgag gtcaccctga 3 00 

ctagaaaaaa ttgaactcac ctacaaatag tctgaaagag tggtgtatat caaatacgtg 360 

ggtagtgttg catttcaaat gangctcttc tgggttgaaa tgatatattt ataaaaccag 420 

aatatcaaaa atgggtgatg tataatgtct ctttagtttt tttggtattt ggcctctttt 480 

aaagcctgtc ngatgtatgg gagaaaaaca atgaaccgtg ctttgatttc ctatcaagtc 540 

actcttaaga acatacatat tggttaaagt aactcggtct ttttttatct gattctttga 600 

ggcactatgg gtagcaaaat aaccacttac aaatttaaat gtaatataca cttcttttct 660 

gngtgtcaag tccttatttt tangtgccta attggacatt ttaaaaggtt aaattattng 720 

gttggcatat taatntcaaa aaatctatta attnatttta atgcctggta ccg 773 

<210> 798 
<211> 812 
<212> DNA 

<213> Homo sapiens 



<400> 798 

gtcaatnctn ttcatgaccc tatcgattcg aattcggcac gaggctggag cacgctggag 60 

aggccatccc tgccctggca gcccgcggct gggggagact cctttgcccc attctttgcc 120 

ggtttcctgc cattattggt gtgcaagaca aaacagggct gcacagtggc agagaagtcc 180 

tttgcagtgg ggaccttggc agagactatt cagggcctgg gtgctgcctc agcccagttt 240 

gtgtctcggc tgctccctgt gctgttgagc accgcccaag aggcagaccc cgaggtgcga 300 

agcaatgcca tcttcgggat gggcgtgctg gcagagcatg ggggccaccc tgcccaggaa 360 

cactttccca agctgctggg gctccttttt cccctctggc gcgggagcga catgatcgtg 420 

tccgtgacaa catctgtggg gcacttgccc gctgttgatg gccantccca ccaggaaacc 480 
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agaccccaag tgctggctgc ctactgcatg ccctgncact gaaaggagga acttgnaaga 540 

atgggtcacc atttgggcgc ctttttaact ttctgtacca gancaacccc ttgacaaggt 600 

tataaaatgt nggctccccg aaccttnttg cgtattcttg cagncctcaa ttcttggctt 660 

gaccaaccaa aggattccca cccangaaaa cccnaanggg cccnaaactt gttnncttgn 720 

ttnccttgga ccgtttcctt ggggccaaaa acaggnanaa cccggacang gttttttnaa 780 

accagntttt tggggcttta aattggcctt gg 812 

<210> 799 
<211> 758 
<212> DNA 

<213> Homo sapiens 



<400> 799 

ctaatagctt ttcattcnaa tgcttgtgat ccctcgattc gaattccgtt gctgtcggac 60 

agattgccct agtacccacc cacctatcag ggttatgcaa tggaacatcc tcgcccaagc 120 

tcttggagaa ggcaaagaca actttgtaca gtgccctgtt gaagcactca aatgggaaga 180 

aaggaaatgt ctcatcctgg aagaaatcct ggcctaccag cctgatatat tgtgcctcca 240 

agaggtggac cactattttg acaccttcca gccactcctc agtagactag gctatcaagg 300 

cacgtttttc cccaaaccct ggtcaccttg tctagatgta gaacacaaca atggaccaga 360 

tggttgtgcc ttattttttc ttcaaaaccg attcaagcta gtcaacagtg ccaatattag 420 

gctgacagcc atgacattga aaaccaacca ggtggccatt gcacagaccc tggagtgcaa 480 

ggagtcaggc cgacagttct gcatcgctgt tacccatcta aaagcacgca ctggctggga 540 

agcggtttcg atcagcttaa ggcttgtgga ctcttcagaa cctgcaaaac atnacccaag 600 

gagcccaaga ttncccttat tgtgtgtggg gacttcaatg canaccaaca gaanaaggtc 660 

tncaaacact ttgcttcttn cagnctnaac cttganagnc ggcctacaag ntgctgaatg 720 

cttgatgggc aatttagaac ccccatacac ctacctgg 758 

<210> 800 
<211> 770 
<212> DNA 

<213> Homo sapiens 



<400> 800 

ttnaaancng cnttggactc cttgcaggat cccatcgatt cgtttaaact gagctccaaa 60 

tgacgttcaa acacccctct cgggtagagt tttcatggtg gaacggttgc gcccaccaaa 120 

cagaagctta tgtttttggc acagaagcct gggccatttt catggacacc tggctggacc 180 

tcggtggaag tgaactccgt aggttgttgc gttcactgca gcacctcaca tgataccgtc 240 

ccctctcatg gaacggagcc tcccccatgc agcccccact caaatggagt tttaaaggct 300 

gggttcaggt tacgggggcg tttctcaccg tctgaatgcg gaggacagag acnagctcca 360 

gggagcgtgg gcgggtgacg gcgctgagat gcgtgatgtc tcggaaacgt cctcgcatcc 420 

ctcancgcgg gcgctgactg ccgcggccct tgcctgtctt caggagcgct ccagcttcgc 480 

ccacacaccc cgggctgatg tcccctcgct ccggcggcct gcagacccca nagtgcctgt 540 

ctcgggaggg ctccccattc acacgaccct gagtttgggt ccaagttagc ttctgtccca 600 

aagtaccngt attcccaaag cgcacccggt aaagganccg ggccggncct tntttgcggg 660 

gccgggggcc ggggccggga actcgtnggg ggttgccngg aanggggtta accgtnccgg 720 

ttnttccgnc cttncgtgca aggcttnccc cgttaagngg cccaaaccnt 770 

<210> 801 
<211> 573 
<212> DNA 

<213> Homo sapiens 
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<400> 801 

ggagccctag agctccacaa caggactcag agcctctaac cagttccagc actccagact 60 

ccagccacac tccaacacag caccatgatc ccagccaccc gctcgcttct ctgtgcagcg 120 

ctgctgctgc tggccaccag ccgcctggcc acaggtaggt ctcgccactg ccactggggg 180 

aggagggacc tctggtgagc gcagcctccc acagtcccgc tgaccaagag tcttctccca 24 0 

tagggcgcct atcgccaatg agctgcgctg tcagtgcctg cagaccatgg ctgggattca 300 

cctcaagaac atccagagct tgaaggtgtt gccctcaggg ccccactgca cccaaaccga 360 

agtcatgtga gtatcttccc ggttagcttc tgccacttcc agactcgccc aaaccctccc 420 

gcgcccccac acttctccta gtgggaatgc ctaacatgtg ggtctatcct tctctctgca 480 

gagccacact caagaatggt cgcgaggctt gccttgaccc tgaagctccc ttggttcaga 540 

aaattgtcca aaagatgcta aagtgagttg tga 573 

<210> 802 
<211> 1390 
<212> DNA 

<213> Homo sapiens 
<400> 802 

tttttttttt cacaaggaat atcattttat tactgtaatc acaaaatcgt aatttctgta 60 

caggaatgta taagtgaaca ttattcaaag cattggtaat tcacttcata aagagggtaa 120 

acatactaca gaacatattg taaagaaaaa atattgtaaa attttctggt cttgcagtgc 180 

actatttagt gcaagtattt aagacacaat agtgttcaat tcagcaaagt attgcagaat 24 0 

gtcatgccac agtccactta attcaaagag ggtcaggaca tgcagcttgt aataaaatgt 300 

cagagtgtgt gtgtgtgtgt gtgtgtgtat ataaaaccac atgtaattca taaaatatat 360 

agtggtttat ttagatggtt ttaaatgatt tcactgtgga atccagcata actggaacaa 420 

catccaaggt cttcttaacg gcaacaatct tattgctagg caatggcytt ggcttcaggt 480 

argaatgcyt cccagtattt tatcagctgt tgttgtgttt gaactagtga ttctaagtac 540 

ttgatgataa cggttttaaa atccttcact cgttctttct caaatcttcc cacttctttt 600 

cgaatcgttt tagatatctg ttcaaaatct ctttcccctt gttgcacttt cgcctcccac 660 

tctcttattt catttttagc ttgctgtatt ttatctggtt tgttagcaac catcattttt 720 

gcttcagctt cacgtttttt gagcaaagta atttgagcat cttcccattt ctgccagcac 780 

ttcattcgat ggtcaaacac acctttcact gcagcaataa gacgaatgta gtcactaagt 84 0 

agttctgaaa acatataaaa gtcagmaaaa gcttgttctt gatgtaactg gtctatcttc 900 

tcctcaacct ctgcaagctg agacaaagct ctagataaag cagtatgatc ctcagaatta 960 

cctaacatgg cagcactttt agcaaaggca gctgtgttgg ctgaaagttc ttttctatga 1020 

cagamcaagg cttcaacact gacatgaagt ttcctaagtt gctgatccag attctcaaat 1080 

tgctgctgct tttcttcaaa ccatgcatcc gattcattca tcttgattgt cattttgttg 1140 

acagcgtcgg cagccttgtt caccatcctc aatattcctg ctccactcag agcctgtgta 1200 

ttaactgctc taggcagctc tgaactttcc aagaactgcc ttaaatcagg atcctgtagt 1260 

aaagttggat gttttactgt tctttgaaga tacctttcaa gagctgctct ccgtttttct 1320 

acaaactcag tggatgatga gtcttcttta cccactttga ccttggtcat ccctactata 1380 

ctcttttctg 1390 

<210> 803 
<211> 947 
<212> DNA 

<213> Homo sapiens 
<400> 803 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 60 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacmtky kkcwsgsagg 120 
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ngagagcagc ccagaacccc tggacaacat cttgttggca gcctttgagt ttgacatcca 
tcaagtaatc aaagagtgca gcatcgccct gagcaactgg tggtttgtgg cccacctgac 
agacctgctg gaccactgca agctcctcca gtcacacaac ctctatttcg gttccaacat 
gagagagttc ctcctgctgg agtacgcctc gggactgttt gctcatccca gcctgtggca 
gctgggggtc gattactttg attactgccc cgagctgggc cgagtctccc tggagctgca 
cattgagcgg atacctctga acaccgagca gaaagccctg aaggtgctgc ggatctgtga 
gcagcggcag atgactgaac aagttcgcag catttgtaag atcttagcca tgaaagccgt 
ccgcaacaat cgcctgggtt ctgccctctc ttggagcatc cgtgctaagg atgccgcctt 
tgccacgctc gtgtcagaca ggttcctcag ggattactgt gagcgaggct gcttttctga 
tttggatctc attgacaacc tggggccagc catgatgctc agtgaccgac tgacattcct 
gggaaagtat cgcgagttcc accgtatgta cggggagaag cgttttgccg acgcagcttc 
tctccttctg tccttgatga cgtctcggat tgcccctcgg tctttctgga tgactctgct 
gacagacgcc ttgccccttt tggaacagaa acaggtgatt ttctcagcag aacagactta 
tgagttgatg cggtgtctgg aggacttgac gtcaagaaga cctgtgc 

<210> 804 
<211> 532 
<212> DNA 
<213> Homo sapiens 

<400> 804 

cctctgccct cccaggttca agccattttc ctgcctcagc ctcccgagnt agactgggac 60 

tgcaggtgcg catcaccacg cctggntaat ttttgtattt tgagtagaga tggggtttca 120 

ccatgttggc caggctggtn tcgaactcct ggccctcaag tgatccaccc acctcagcct 180 

cccaaagtac agggnttata ggcgtgcgcc antntgcccg gccgagaaca atttntcaca 240 

agnttacttt tctagttttg ccaatgcatg gtgaaagtga acccaagcct gggaactgca 300 

ggcctagaca atgcaggrmm ykksttsamm cwsrsmsrmr smsstysmar ywmrsssagm 360 

cttggaaagg agaagtgtga ggcaggtgtg ggtaggacct ctttttagta cctagaaaaa 420 

ggctaagaaa gtggcctgga gatgtttaga aggttaaaac caacgaagaa aaaaatcaat 480 

gacaacctat caggaacgtg attgactctc agaatggaga actggcgaat eg 532 

<210> 805 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<400> 805 

aatgcattnt tgatttttta ttgcagatga tgaaaaagtt ttagatatag acagtgeega 60 

tggttacaca atgttgtaaa tgtatttaat cccacttacg aatgattaaa atgataaatc 120 

ttatgtttat ttcatcacta ccaaaaggct gtgggtgcag gggtgctggt ttctggtcct 180 

agectaagag actggcagtt tccaccttct atctcttggg acagtagctc tgggagcect 240 

gagctgtcat gcaggaagtc cagctaccct gagaccacca tgctggaaag gecacaggga 300 

ggagctctgt ggacagtccc agctgaacct tgccttccag ctgtccctgt caagatgeca 360 

ggsatgtgag taaagecate atggacccty tagaccagac tgcccaccag cagggtaccw 420 

tctggcagcc acatggagca gaagaacege ccagctgagc cacttccaaa ctcttgaccc 480 

actaagtcat gatccacaat gaacccatca tagggatggt tggctttgea gtgtggataa 54 0 

tgaggatgtc at ^ ~ 552 

<210> 806 
<211> 1646 
<212> DNA 
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<213> Homo sapiens 
<400> 806 

aactagtata tttacaacat cagaaacttc aatatggaga tttgttgttc ctatatcatg 60 

atctttagca gcaactacac cataggcact gcacaacctg ggtcctagat caggacgtac 120 

aaaaaatcct ggcaaatgag aggccaaatt gaattttcct tctggattac aatattctgg 180 

caatggcaga ctttttaaaa gatcttcgta tcttgctggc atcatagtct tgaagtcttc 240 

tcctgaaggc caatctttca attttaaaac aactgtttct ccactcttgt ttttctgccg 300 

ttttgaaact tcttcaaaac catcccagaa ttccttaaca ttggcatttg aaatgatgct 36 0 

atctttgcag ttcaggagat cagcttggtg gtctccaaaa tcaagactaa ttgattccgc 420 

cttccatagg ctaatgttca ttttcttatg cacaccagaa accactgcag gctgtccttg 48 0 

tttccaacat tctttgaaaa gcttccaatt actgctattc ttataatcct taagccataa 540 

aatatgcttc tcacagatcc aagaatgtgg tatatcactg tataatttat tattttcatc 600 

cactgcagat attatgcttt cttcaggctc ttctttaagc tctggtttta catttatctt 660 

ggaggtttta cttggtggaa ttttgttttc aacaactgaa gcaattatgt catcaagaat 720 

gttaggcata gtccgtccac ttttgctact tggggctccc attgaatata ctggggcaaa 780 

ggcaatgcca gcatctgtas accccacacg tagctttcca gctgttgtag tcagcaaatc 840 

ccgtaaggtt gagccttgtt cattattctg ggacacaaga ggtgatgttc tgccatttgg 900 

agattcagag ttgtcttgtt ctctttcttc tttaatttgg ttttcaaggg taagttcttt 960 

gttttctttt ttttcctctc tggctttttg ctctgcaaga tctgctaacc agtgcagtgg 102 0 

tgactgggat tctggaggag ttaacttgtt atctgtgcct acatcactct ctgggctgct 1080 

gccaccattt ttctcagact tcggaggagt attttgctgc tgagactcag gcatgcacag 114 0 

agaaatttta ttactgtgat taagaacatt ctgtaaaact tgagatacac cattcattgt 1200 

aggaaaattt ccaacttgta aattctgttt gttagtacaa tgacaatggg atttaatacc 1260 

atatttttcc ctaagagtgt gcatggcatc tagaagatct gtcaaaacag aaccaggtat 132 0 

aatttgggtt ggcattaaat gtttgtgatc atgaggctgt ccyttcacac acttcatcca 1380 

agcatattag ttctttatcc ctagramyyc tycctttcct ttngccttgt aacaatctaa 1440 

gcaganccac aawkccacat tttkggcaga cccagtnraw kktaancawk gntgcttcac 1500 

atgcatcaca catctcccgg actcctctca ctgctctttt ccaggcaatt ttggcatcct 1560 

ttttcaccca ggacaaagct gttttttcag atgttactaa ttgacagaac ttatcaccta 162 0 

ttatatccaa gatatattta gaagtc t 1646 

<210> 807 

<211> 1029 

<212> DNA 

<213> Homo sapiens 

<400> 807 

tggggctgtg actgtattta cttcattctt gaatcccgcg tccccgtggc tgggggctga 60 

cacatccctg ggcaccactg tgacttcctg tgggtccctt cccttctgtc cctgactctg 120 

tagacccccc acaggaaggg tcctaggtag ggggaggttc ctcctccctt gaaaccctgg 180 

gccactctgt caaggcaaag ctctgggccc agcaccttgt aaaggctttg atgagaggag 240 

ctctggcttt tgctcagggc ctttggaccc caccctccag cccccaggaa tgcaggcgct 300 

caaagcctgt ggtnaggctg cccgaagcac gtgccgcagt tcttctggag tgggagcagg 360 

gggacagagc tttgggtaga ggagggtcac ctgcaaagct ggaatgccag gggagtgggc 42 0 

ggtgcctcca gctcctgggg gccagggtgt ctccatacct catgggcctg agcctgggca 480 

ggggtctgga gtgcacatag cccccaggca gggagagggc agtgacagga cagagccact 54 0 

catctgtccc aaagctgcac caaggggtgt cagcaacccc aacctactga cctactttgg 600 

gaccacaggc ccatctagtg caaatgaggc ccagaaagga gaaatgcttt gctcaacagc 660 

cacagtaggc tgacgtaacc tatgtaatgt agggtcaggg tgggcctgag ggatgancca 720 

ggtggtgggc aggtganaca ccaggtcccc tcctggcctc tgccccaccc agccctctcc 780 
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tgcacggcta ccagaagatg tccgggaaga acanactagc cctgagtagg gagtgtggtc 84 0 

aggtgcagag gagggcaggg gcccggatcc tggcccagaa acactctaaa acagaatccg 900 

atcctgagat gatccaaatc aaacagaata cttgacggaa atagtagagt ctgaaaatga 960 

tgcactctgc gcacacatat acaagacaca cacacacaca cgaatccacg cacacgaggc 1020 

acaccccac 1029 

<210> 808 
<211> 836 
<212> DNA 

<213> Homo sapiens 
<400> 808 

aaaaccgggt ataacacttt aatatagatt tgtggaactc tggcccttgc agccagaata 60 

cacatttata agccataaat aaagcacgca gaaaccataa attaatcgga cccgagacct 120 

ggatttcacc gtgtcaagat tgggaatgct ttttttttct ttttcttggt catttacaac 180 

agacccttac attatttttt ttcctgtttt taaacaatag tacaaccctc tggttctgtt 240 

aaaactacat ggttttacac cgagtcactc acaaaatttt tttttttttt taagtaagac 300 

ttccctgcaa caacagcaat ggaggagaac aacaacaaca aaaaaatcag aatctgcagg 360 

tgcttgaaga agcaggagtc tacacagtag tggaaaccgg aggctttttt ttaactttat 420 

attctttccc gttttcctcc ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 480 

gacggaacrg ctgcagcggg tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 540 

gttatggatt tcttctccct tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 600 

acttgctctc tcacttrtcc tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 660 

gtcttcttaa gaggagttca taatgggccg ggagtacacc ccctggtagt aggaggtatc 72 0 

tgcggccagg ggcgaggcgt ccaggcccgt tttgttcgtg accgggccca tggccaagct 780 

gccaggcatg ggggaaccgt agccggggta gtgcatcacc tgttcgtagg ccttga 836 

<210> 809 

<211> 1844 

<212> DNA 

<213> Homo sapiens 

<400> 809 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 60 

aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 120 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 180 

ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 

tgagcagggc caacaggagc tgaagaactt gcgtgcttgt gtcctggagc tggaagagcg 300 

gctgagcacg ccaggagggc ttggtgcaag agcttcagaa aaaacaggtg gaattgcagg 360 

aagaacggag gggactgatg tcccaactag aggagaagga gaggaggctg caacatcaga 420 

agcagccctg tcaagcagcc aagcagaagt ggcatctctg cggcaggaga ctgtggccca 480 

ggcagcctta ctgactgagc gggaagaacg tcttcatggg ctagaaatgg agcgccggcg 540 

actgcacaac cagctgcagg aactcaaggg caacatccgt gtattctgcc gggtccgccc 600 

tgtcctgccg ggggagccca ctccaccccc tggcctcctc ctgtttccct ctggccctgg 660 

tgggccctct gatcctccaa cccgccttag cctctcccgg tctgacgagc ggcgtgggac 720 

cctgagtggg gcaccagctc ccccaactcg ccatgatttt tcctttgacc gggtattccc 780 

accaggaagt ggacaggatg aagtgtttga agagattgcc atgcttgtcc agtcagccct 840 

ggatggctat ccagtatgca tctttgccta tggccagaca ggcagtggca agaccttcac 900 

aatggagggt gggcctgggg gagaccccca gttggagggg ctgatccctc gggccctgcg 960 

gcacctcttc tctgtggctc aggagctgag tggtcagggc tggacctaca gctttgtagc 1020 

aagctacgta gagatctaca atgagactgt ccgggacctg ctggccactg gaacccggaa 1080 
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gggtcaaggg ggcgagtgtg agattcgccg tgcagggcca gggagtgagg agctcactgt 114 0 

caccaatgct cgatatgtcc ctgtctcctg tgagaaagaa gtggacgccc tgcttcatct 1200 

ggcccgccag aatcgggctg tggcccgcac agcccagaat gaacggtcat cacgcagcca 1260 

cagtgtattc cagctacaga tttctgggga gcactccagc cgaggcctgc agtgtggggc 1320 

ccccctcagt cttgtggacc tggccgggag tgagcgactt gaccccggct tagccctcgg 1380 

ccccggggag cgggaacgct tcgggaaaca caggccatta acagcagcct gtccacgctg 1440 

gggctggtta tcatggccct gagcaacaag gagtcccacg tgccttaccg gaacagcaaa 1500 

ctgacctacc tgctgcagaa ctctctgggt ggtagtgyta agatgctcat 'gtttgtgaac 1560 

atttytccay tggaagagaa cgtytccgag tccctcaact ctctacgctt tgcctccaag 1620 

gtgaaccagt gtgttattgg tactgctcag gccaacagga agtgaagacg gatccagatc 1680 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtcct atgtctatgt atcgggtgag 1740 

gggtgggagg gttgctggag ggtgctttat tgggtggagg gcaccatgtc ccagggctat 1800 

caaataaaga atagtttggt ttttttttta aataaaggtt ttat 1844 



<210> 810 
<211> 489 
<212> DNA 
<213> Homo sapiens 



<400> 810 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtakw wmmwgctwyw ywtsyysswm mywmmycgkg racctcraga tctyyaccct 120 

aaaatarttc tgttgaattt caccctggcr atgtaaaytg akagcttatc ttcacagatg 180 

ysrganaakr gmcmayycmy cwkcaswcct swgncwmays tswrwcwrat ksmtkycykw 240 

kccctattta tgtaaaaata cagggtccct gagccagcct aaggcataag tgacttatcc 300 

ctcctccctg ctcacatata aattgtgtat ttagtgaaag gctgatcaaa grttcaaagr 360 

atgttatttg ttatctacct gtggacccag naggtcccca attccagtta tttccacctt 420 

tccaggaccg ggaccaatgt atatatgtaa ctggattggc tgttctcgtg tgtttgttaa 480 

aatgtgtgg 489 

<210> 811 

<211> 471 

<212> DNA 

<213> Homo sapiens 



<400> 811 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 

cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 

tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 

ctccagccct tgctccactc atgcctgggg gnctgggsyy gagtggtatc cctacctggc 240 

ctccccctgg cctctggcct ccagcgctgg gtttgtcgag tgagagagag agaggagctt 300 

gggttgcttc cctgtccccg ccccctctgt ggcattgtcc ctcccactct tatttttcta 360 

ccaattgcta tttttccgaa caatccttgt agagtatgta ccatccaaag gcaggagggc 42 0 

cctcggtggc cggctctggt tggagatggt acagttttat tgtacaggtg c 471 

<210> 812 
<211> 579 
<212> DNA 
<213> Homo sapiens 



<400> 812 
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cccaatgaat caacatactt tattagaccc actaagtgcc aggggagggg cctgtgccta 60 

ngagccaggt tacagggctc acccgtagat tcagtctggt ctctccccat catgcctctc 120 

acttccagtc tgggcttcta ataggagggc cccgacttct tccctcccag tcattctctc 180 

gaatggagaa tctttcctca ttccagggac accaaggctc aggaaggggc ctatccatca 240 

tcagtagagc cagacaagct ctcccatcgg acgtcctgtg gctgggccca gaaatgggtg 3 00 

ccgctgcctg tgggactgcc cttccgggaa ggaccagggt gtcttcagtg ctcttggcct 360 

gcacgtggna ggagagtagg cagatgtctg gtgctcttta agctcaaagg catcatggcc 420 

ctctckgnwg sarcrrrsrs akamragkym sssatcncag scagcscwnk arskstsgca 4 80 

nwsmwcatts casmtgcasc mmcmggrrrs mkcsksywcm kmagnsktnm scmtsgsrgy 540 

cagcgcagcg tagggtggca tcctcattgc agatgcagc 579 

<210> 813 
<211> 562 
<212> DNA 

<213> Homo sapiens 
<400> 813 

tttttttttt tccagatgta actcttgtct tttattccag catctcccag agctccaata 60 

tgtacagact ttatttatac acatataata tacaccatat atacttattt atagatattc 120 

acacaccagc ccacacactc gcacacactc acacgcacac acccttccag gaggggcgtg 180 

tggctgcctt ggagtcccgc tagscccaaa caagtgatac tgggcttgcc aggcagttgt 240 

gaggttttgt gttttttgct tttaaaaaga aggccatttc ctccagatgt gtcctccctc 300 

tccccaagcc ctaaaactcc tccccaaaac actctgaaaa aaattttttt aaaacaagrg 360 

gnttttcctt tgctytggsc caagtagttt ctngganagn tccrggscca tccacaagny 420 

ccgtgcaggt cctagagcac gagagccggg cgtggccttg gtcaggcctg cagctgtgcc 480 

ctctgagggg agaggggagg cgctatagca tcaagggcac ctgccagatg aggagggtgc 540 

tgtccgtctc cccacacggg gc 562 

<210> 814 
<211> 594 
<212> DNA 

<213> Homo sapiens 
<400> 814 

agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 60 

ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cckccaaryc 120 

tsmsyckgwc scmswgytsk smsmmwgmmt ycwgcmsygs smrccntgss rryktswrkc 180 

tggcaccagg ctgnagnctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 240 

cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctkgcctcca 300 

cccaccccta gccagcagct cccagtcaga caacagccag awatgtctcc agactctgcc 360 

cagcctcccc aggtagccac cctcgagaca cgacctcaga gtctctgtgt ctcctagaag 420 

cctgacagag acccccaggg cagtgggtgg gtngcgggct agagaccctt gcctgtntcc 480 

gggaccctgg cgccgctctc ccctcctgtg gatccctccg gactaacagt gttcttagtn 540 

ggcagangct ggggcacccc ttnggccctg ncaggcatng ccattggcgc angc 594 

<210> 815 
<211> 812 
<212> DNA 

<213> Homo sapiens 
<400> 815 
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aaaaccgggt 
cacatttata 
ggatttcacc 
agacccttac 
aaaactacat 
ttccctgcaa 
tgcttgaaga 
attctttccc 
gacggaacrg 
gttatggatt 
acttgctctc 
gtcttcttaa 
tgcggccagg 
gccaggcatg 



ataacacttt 
agccataaat 
gtgtcaagat 
attatttttt 
ggttttacac 
caacagcaat 
agcaggagtc 
gttttcctcc 
ctgcagcggg 
tcttctccct 
tcacttrtcc 
gaggagttca 
ggcgaggcgt 
ggggaaccgt 



aatatagatt tgtggaactc tggcccttgc agccagaata 
aaagcacgca gaaaccataa attaatcgga cccgagacct 
tgggaatgct ttttttttct ttttcttggt catttacaac 
,ttcctgtttt taaacaatag tacaaccctc tggttctgtt 
cgagtcactc acaaaatttt tttttttttt taagtaagac 
ggaggagaac aacaacaaca aaaaaatcag aatctgcagg 
tacacagtag tggaaaccgg aggctttttt ttaactttat 
ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 
tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 
tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 
tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 
taatgggccg ggagtacacc ccctggtagt aggaggtatc 
ccaggcccgt tttgttcgtg accgggccca tggccaagct 
agccggggta gt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
812 



<210> 816 
<211> 999 
<212> DNA 

<213> Homo sapiens 
<400> 816 

aagccgcctt ctgagccttt ngcctctgtt gttcctcctg ctgcctgtga gttttcatgt 60 

gtgcatttcg gcttttgatc ttgaagaaga ctttgccnca ctccttgcag gggaagatgg 120 

tggtggggtc tgtctcgccg ctggtggtgc tgtgagaggg tgancncttt accncnacag 180 

tacccactct gggtgccncc aggcttctgc ttcccagags gkrtrrmmmc kmgggccttg 240 

ctttgccccc tgnaaaagct gccccctanc catagtatct cccaggcaaa gatgccatgc 300 

tcactgcaaa ctatggaatg aggtcagaac agaatcaaag taacgcttga tgggaaaagt 360 

tggccccaag accccagtac taagagggtc gcctgcgtct cacacacaca cactcacagc 420 

aagctttggg ataaaaggca accgggatgg ttgacatctg aatgcaatgg aacatgaagg 480 

tcagcttcag tccctactgg gaatgatttc atgagaaggt agcccagatg aaacacctct 540 

taaagatagt tgtgccaatt atttattccc ccaacccccc acaaaaacaa atttttttaa 600 

ataaaaggaa aagaaatagg attttttttt ctaaacctga ataaaatgac cacttttaaa 660 

acagrtagtt taaaagggtt acaaaacaag caggcagtcc aggtttcctg attaatgaag 720 

atggaggccg tgggttttca ctgtctctaa gtgacacaca gggctttata gttctgcgtc 780 

accctgaagc aagactgaat cttgatcatc caagagaaga tcggtgtcca caacttcagc 840 

ctcttccatg acacctccca actgctggac gacgtcgtcg tcgaggatgt ccacatcctt 900 

gtatgggttt gatcagactc agctggtcca ggggcagcag cmcgrcagca ccccacgggc 960 

ccgtagtcct ctcaatcgtg gctgccatct cagctgcaa 999 

<210> 817 
<211> 653 
<212> DNA 

<213> Homo sapiens 
<400> 817 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 240 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 300 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 
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cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 
tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 
cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 
cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 
agttaaaaaa ctcctcgcaa ctctgggtac tcagcaacca tgctttgagg aag 



420 
480 
540 
600 
653 



<210> 818 
<211> 1225 
<212> DNA 

<213> Homo sapiens 
<400> 818 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 60 

ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 120 

aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 180 

cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 24 0 

ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

acttgaagaa ctaaaacaac aatgcaaacc tttcagcatt gtttggccaa acttgttaaa 360 

actgtaatgc aagaaccaaa tgcactogtga tgtggcacca actaattagc aagcatgaat 420 

ttttcaccca agagtgaaaa aaggaaaatc taccatggct tgaagttaaa gagcagaact 480 

cctgactacc attctatgac tgatcaaaag actaatagtt aaaaacctca gcaggccttg 540 

ttcacgatat gcagaaaaaa aagtgctgca gtttagatac ctctggaatt tttccacagt 600 

gtcacaggtt tgtaatactt gaagccctac atttctaaga atatatttct tgctcagttg 660 

tttcakgcaa gcccaagact ttgtaatttt taaagggccc aagatttttt tttttttttt 720 

tttttcaaat aacagaccag cttctttttc ttgcagttac agatgtaatt tcctttttgt 780 

tgtcaaacat aaggtaccaa atatgatgca ataaattgtt ttgaaaaaca gttgtgtgaa 840 

tatttcaact aatctgtgtt gggcttctgt gaaatacaca ggtggaaaca gaggtgcaag 900 

ccagagcaat ngtaatatgc tgtaaggcta gtgcagatgg gagcttttta gaaggggcta 960 

agtgctggtg tcagggaaat tccataatga agtagaatgc tgctcctgca ttaagatttc 1020 

attgagggca aggctggtgg caggtactat gaatgtaatt cataatttaa aaggaaaact 1080 

aaaaactatt ttgatttggg aaaatgagcc ttaatttgtt aaacctatac actgaggaac 1140 

tagcctcagg ctttaatatt ctcattggca tttgccaagg tcctgaggcc aaataaggtt 12 00 

taagttaaaa caaatccaat tgtnt 1225 

<210> 819 
<211> 1024 
<212> DNA 

<213> Homo sapiens 



<400> 819 



gacaccccag 
gctgtggccg 
gacarggccg 
agctactatc 
tcactgctgc 
ctgtatggcc 
ctgacgmtgg 
aaggtgaccg 
tggggcttcc 
gagactgtgt 
ctcattccct 



atgcagccac 
tgggctcagg 
cgctcctgca 
agagcttctt 
cccggcctgg 
ccttctggat 
tgctggccca 
tggcaggcat 
tgcggtggcg 
gcatctacgg 
gtgcctntgg 



caccagcaga agcgatcagc tgaccccaca agggcacgtg 
tggcagctat ggagccgagg atgaggtgga ggaggagagt 
ggagcagcag cagcagcagc agccgggatt ctggaccttc 
tgacgtggac acctcacagg tcctggaccg gatcaaaggc 
ccacaacttt gtgcggcacc atctgcggaa tcggccggat 
ctgtgccacg ttggcctttg tcctggccgt cactggcaac 
gaggagggac ccctccatcc actacagccc ccagttccac 
cagcatctac tgctatrcgt ggctggtgcc cctggccctg 
caagggtgtc caggagcgca tggggcccta caccttcctg 
ctactccctc tttgtcttca tccccatggt ggtcctgtgg 
ctacagtggc tctttggggg cgctggccct gggcctgtnc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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aaccaccggg ctggtaatca ccctctggcc cgtggtccgt gaggacacca ggctggtggc 72 0 

cacagtgctg ctgtccgtgg tcgtgctgcn ccacgccctc ctggccatgg gctgtaagtt 780 

gtacttcttc cagtcgctgc ctcnggagna cgtggctcct ccaccccaaa tcanatctct 840 

gccctcaaac atcgcgctgt cccctacctt gccgcagtcc ctggccccct cctaggaagg 900 

nccgggtccc acaggcaaca cctaagtgga ccaacccctc tgcctgtcct gccccccaga 960 

cgatgactga aggctccttt gacaccttga gatgantctg ctactttcca gacttttctt 1020 

acaa 1024 

<210> 820 
<211> 631 
<212> DNA 

<213> Homo sapiens 
<400> 820 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 24 0 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 300 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 

cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 42 0 

tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 480 

cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 540 

cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 600 

agttaaaaaa ctcctcgcaa ctctgggtac t 631 

<210> 821 
<211> 635 
<212> DNA 

<213> Homo sapiens 
<400> 821 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 120 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tssgacctcg 180 

999999 ccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 240 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 300 

tcaagtaccg cgcccgcttc ttccacctgg ctgacctctt cctgtcctcc tcccacctcc 36 0 

ccgcctacct ggtggccgcc ttcgccaagc ggctggcccg cctggccctg acggctcccc 420 

ctgaggccct gctcatggtc ctgcctttca tctgtaacct gctgcgccgg caccctgcct 480 

gccgggtcct cgtgcaccgt ccacacggcc ctcgagttgg aacgccgacc cttacgaacc 540 

ctgggagagg aggacccagc ccagagccgg gctttgggag agttccttgt tggatttttc 600 

agggccttnc agcggcatta ccaacttgag gtttt 635 

<210> 822 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 822 

tgcttttatc ttgaatgtag ccttcaactt tgtgtaattc cttaccaaaa aggccacatg 60 
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gcttaaaatt caacacacat ttgtccccag tcttgtggtt tataatttcc acattgccat 12 0 

actgttcgat ccacagttta cccacaatga tattatgcac acagcaggtg ggatttgtcc 180 

atgtatatgc ctcattgtgt tcaaggagct ccaaggtgat ggttcctttg ggttctgctt 240 

ctacactctt cccccagaat ttcagtttgg gatagataga gccatgaaag atgaagtcat 300 

tgtttaatcc ttcagcatga aatgcactga ttggtgggtg atggctgacc tgttcggaga 36 0 

tgagtctaaa tccaaggtca tctcgcacta attcataagt ctctcccagc agtgggttga 420 

aaggttttcc agtccgttcc cactgagaag caacagcaga tacagcaaac gcagctacac 48 0 

actgcatcct ttccacagga tcagagagtg aactggcctt gtggaygagg taagtatgct 54 0 

ccatgtattc agttaggcgc tgtaggaagc tcagaggctc attaaatata actggcatcg 600 

tgatcttgga tagttccatt ccaatacatt ttctgaggat gctccagata ctgaagtcat 660 

ttctggaaaa cataggagaa ggcaaacttg ttctgtgttt cttgatgcca ttggagagag 720 

catctccgcc accacagtct ttttcttcgg ac 752 

<210> 823 
<211> 899 
<212> DNA 
<213> Homo sapiens 

<400> 823 

tttgccacag ggtaaacttt tattttagaa tccaatcttt tccccacaca tacacaataa 60 

attaaacaga atccacagta aatgtacatt ttttaacata aaaagtcagt tactgttact 120 

tcatgatcac atgaggatcg tcacagctcc gtgtccatta gcacattacc ctccttgtcc 180 

ttaactctta tccgaccgga tctgtacttc gtttcttgat gaccgtttgc atatacggtt 240 

ttaacagtgc catctgggta ttcccgtctc ttgaactggg cagtatgtag ttctctttgg 300 

ccattattaa actctatgag tttgttgcca tcacgttgta ctctgacaat tgtaccatct 360 

gggaaaatgc tttcttcttg tccatcagga aataagtttt taacagtctg gtcaggaaac 420 

gtgatttctt ttcttccatc tgggtaatgt ttttctrttt aaaaagttgt tacagtaaat 480 

attttttgaa ggaagggaag aatttaatga gagggtggag caagtttgta cctatttgtc 540 

cacttgagaa atgtaagact tccagtcctc cgggtatgtc gtgtgagtgg tctgggcagc 600 

tgcatagtag tagatctgta aagacacaca gtcagtctgc cttttctcca gagatggtta 660 

aactatggag gagaacactt ctggaaacat accactcttt ggtctggcat gacctgcttc 720 

acgtcaccat taaagaaagt gacagtgatg gtcttcccat ctgcactcac ttcctttcga 780 

gttccattgg gaaacagtat aacacggcac ccattcttat aaaccttttc cacctttcca 840 

tcaggatgac tgatttctcc ctgtatgtct tggtcttcct cctcctcttt atattcagg 899 

<210> 824 

<211> 1980 

<212> DNA 

<213> Homo sapiens 

<400> 824 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 

ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 120 

ctcksmwytc ysskwswskm tstgcccrga gctggtttcc gtctctcggc tcggggctgg 180 

aactccggcc caacctaggc gcgcanccgc sacgagatgg cgcacttccg atcaatgtca 240 

aagccgccgg ggagccggga accccagcat gattcttggc ctttgttcgc ttctgatact 300 

aagagcagca cggtacatta tttcacttgt cccgctcccc ttcataacag aaaaagggga 360 

ctcaccctca agaagtgatt ggtatggtaa tttaaagcaa cgcgcattcg ctaggcctcg 420 

cgagcgtcgc cgcgcggaga agccagctgt cccttggcag tgatttcgga aatgtgtcaa 480 

ggcaattcca aaggtgaaaa cgcagccaac tggctcacgg caaagagtgg tcggaagaag 540 

cgctgcccct acacgaagca ccagacactg gagctggaga aggagtttct gttcaatatg 600 
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taccttactc gagagcggcg cctagagatt agccgcagcg tccacctcac ggacagacaa 660 

gtgaaaatct ggtttcagaa ccgcagatng aaactgaaga aaatgaatcg agaaaaccgg 720 

atccgggagc tcacagccaa ctttaatttt tcctgatgaa tctccaggcg acgcggtttt 780 

ttcacttccc gagcgctggt cccctccctc tgtcttcagg ctctgccagg aactcgcacc 840 

tgtgctggag ccctgttcct ccctcccaca ctcgccatct cctgggccgt tacatctgtg 900 

cagggctggt ttgttctgac tttttgtttc tttgtgtttg cttggtgctg gttwatttgt 960 

tgttttctgg gggaaaaagc catatcatgc taaaattcta tagagataga tattgtccta 1020 

agtgtcaagt cctgactggg ctgggtttgc tgtcttgggg tcccactgct cgaaatggcc 1080 

cctgtcttcg gccgagcntg gtttcctgcc cagcctgggg caaacctagc cgaaggccga 1140 

ggtcccattg ttggcgctga ggtgtctggc ctgaggtcaa tggtgcaaag gagccgccac 1200 

cggcatgtct gcctggagtg ctgtgctgtg tttaatcagg ggatacaggc ccctgggttt 1260 

cttttttctt tcttcctttc ttccttggcc aagagaaggg cttacaggca tggacatgca 1320 

ggttggcaaa cgggcttgac tttggctgat ttaaaaagtg agaaagaaag taaaaaaggt 1380 

taatttttcc ttcctctgta agatatccca gctttaaaaa gaaaaaaaaa aagaattacc 1440 

aagagaaggg gacttctctt ccagtttctg taaggtctta cattgcctga ctaaaatgtt 1500 

tcatttacct ctaaatttcc atatccttct ggctgtagat aaataatgta gttttgttta 1560 

tgcatttgga attagtggat ttttttgtca ttaaaattgt taccactggt aacatgtgac 1620 

aagcacacca caattctccc tatcttgtga agttgttttt ttaaatcgcc ttgaacaaaa 1680 

agtttttttt tttgtttgtt tttgctttct gaaattcaca gaagcctagg aggactgggg 1740 

taagcggaat aaactagaga agggagacat tgtttggatt tccttcctat aaatacaaat 1800 

ctgtataaat gtctattatt atgaagaatt gccaatcttg ttttaagcaa atgcattcta 186 0 

tcgttattat aaatgttagt tctagctcta tttacttcta atcttaaatc agaataaatt 1920 

aatattgtat tgctgctgtg cgtggaaaaa gacgatgttt atgttcttat agaataaaag 1980 

<210> 825 
<211> 333 
<212> DNA 

<213> Homo sapiens 
<400> 825 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 60 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 120 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 18 0 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 240 

gaaatggttg ttcatgtcat ttkctcattt ttatattaga ttgkwttttnt wttttccagc 300 

tgacttgtag gaactctaca tcttatcaat att 333 

<210> 826 
<211> 658 
<212> DNA 

<213> Homo sapiens 
<400> 826 

tttttttttt tttttttttt ttttgaaggc ttcatgaata atttattcca tttgaagttt 60 

tgttttttgt ttttgttttt ttttttttaa aaagtataaa ccttttcatt tcctcaatca 120 

caatttgtac aactcagtgt tatggcattc ggcagcaata gtgtttgttc cttattctct 180 

ttttgtcacg ttaaaaanaa agcaattgga ccatattaaa tgtcactgct aaacaacaac 24 0 

tttaaaacgc cccttcataa agtgaccaag ctattttgag agggttgatg ctgacatgtc 300 

cagtaatgac gttacaattt gtagcttaaa ctcaataact ttaaggtcca catatccagt 360 

ttactttgaa aactaaagat gttttaaaac ttcatgaata catcaacctg aggagtattt 42 0 

taggkcccaa atccagtttt taaatttata ctccacnaaa aangaaaata catacataaa 480 
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awtttaaacc mcngttytgg gcccattwaa acaccmaaaa agaccccccn aaaagttaag 540 

anttccagct tanttctgga ngggtgggnc aaaatarraw kktwtawwma wwwymytwwt 600 

ccnkmattca gacaaactaa aatcttaaga ggaaacccag accaaaatat cactcatg 658 

<210> 827 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<400> 827 

attatagaga ttaatctcct ttgctcgaag tctntttaaa tattagtcac atctaaaaca 60 

tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 

cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 

cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 

ctcccaccat taacatagtt cacactgaga tgaggattcc ctatttaaca cttggtccca 300 

acttcttcac agtccatctg gttttgtaga gggaacataa ctggacattc tggtcaggtt 360 

aggtgaggtc aggccttcag gacgctattt tcactgagtt gctttataag gcacattatg 420 

caaaattcca tcagctcttc tgttcactac att 453 

<210> 828 
<211> 657 
<212> DNA 
<213> Homo sapiens 

<400> 828 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

ggaagagaaa gctggagtcc ctggatgccc tagagcctga ggagaaggct gaggactgct 300 

gggagctaca gatcagcccg gagctactgg ctcatgggcg ccaaaaaata ctggatctgc 360 

tgaacgaagg ctcagcccga gatctccgca gtcttcagcg cattggcccg aagaaggccc 420 

agctaatcgt gggctggcgg gagctccacg gccccttcag ccaggtggag gacctggaac 480 

gcgtggaggg cataacgggg aaacagatgg agtccttcct gaaggcaaac atcctgggtc 540 

tcgccgccgg ccagcgctgt ggcgcctcct gaccgtcgtc tcctcactcc gccttttcaa 600 

atttttgtat aaccccgtgt tgtgtaaata cagtttttgc tccggtaaaa aaaaaaa 657 

<210> 829 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 829 

ggtttgagaa aatcaattca aatctgnccc ttctgattgc anctctaacc aggttctgan 60 

cggtgtcaga gacttcccaa tacatttccc ttctagnatg cctcataaat ccactcaaaa 120 

gtaagacacc aaacacacac ctcatttcct gaactgtgac ttccaagctg acatttttct 180 

gagaagcata attattggtt tcattgacaa ttaagttgaa tgtttcatca tcaaaaaata 240 

attcaaaaag ctctactggg ttcaactttt cgctcttgag attcaaaagt ccagaatcca 300 

gtgctgacca gcttggaaaa ttgggtttaa tgtctctttt ggtccaactc ttttctggga 360 

aacatgatac ctttaacttc ttttgagcag gctggatctc aggctcatta tctttttcca 420 

catctgagtc accagagaat gagaggcctt ggagcagttc actcactcga gctttgtctt 480 
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tttttctccc ttttcgggta atgtctcctg cagcatattc cagggatgag atgtgcatgc 540 

gggcccacaa atcacctggg tgacggtcct tcagagtgtt caaatgtgca actgtccttt 600 

cagtagcaat aggagtacta caaggaatct ggggtgcaca ctctctgttg ggctttcctg 660 

aggcttctcc actttgttcc atttcttcag aagtttcttg ctttgcttta aacaatctat 720 

ctttagttac aatttcttca gctgggtgta gccccagctt tttagaaggc tgagg 775 

<210> 830 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<400> 830 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcak agcsmcasgk gcagtgtggc ntgctgcccc ccaggaggcc tgaggctggg 240 

tctcactgct ctgaaaagac acaaccagaa tggcctgggg ctcaggccct tggctgagtg 3 00 

ggaatgcgtt gggactgccc agctgagcta tcaggtgccc atcttttctg gtmccagcag 360 

tggtgaggag agcacaggca ggcctcgccc ctcccttgct canccagttt ccc 413 

<210> 831 
<211> 876 
<212> DNA 
<213> Homo sapiens 

<400> 831 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 120 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgnykaca tgcakaagat gctgtttgag gaacgaaatc attttgccga gatagagaca 240 

gagttacaag ctgagctggt cagaatggag caacagcacc aagagaaggt gctgtacctt 300 

ctcagccagc tgcagcaaag ccaaatggca gagaagcagt tagaggaatc agtcagtgaa 360 

aaggaacagc agctgctgag cacactgaag tgtcaggatg aagaacttga gaaaatgcga 420 

gaagtgtgtg agcaaaatca gcagcttctc cgagagaatg aaatcatcaa gcagaaactg 480 

accctcctcc aggtagccag cagacagaaa catcttccta aggataccct tctatctcca 540 

gactcttctt ttgaatatgt cccacctaag ccaaaacctt ctcgtgttaa agaaaagttc 600 

ctggagcaaa gcatggacat cgaggatcta aaatattgtt cagagcattc tgtgaatgag 660 

catgaggatg gtgatggtga tgatgatgag ggggatgacg aggaatggaa gccaacaaaa 720 

ttagttaagg tgtccaggga agaacatcca agggtgttcc tgcaagggct ggtgtgggaa 780 

ccangccagt gtggggttcc aggnaagcca aaagtncaga ctggtggtgt tgactgtttg 840 

ctgtgacccc cacaaagttt ncggaaccgc ccacca 876 

<210> 832 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 832 

tagacataga aaacatacag taagaatatg gtattataat cttacggsam mamygysrmm 60 

trnsckkknw rwmktkgwaa agykgyrmyr sgrcsyanra mtanmmmtas ctrgytrrky 120 

mrywtwwmma tycctksccm gggagtttga aatttnatac tatagaaata actttaggtt 180 
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ttaggtagag ttaaagaggt aaagcacatg ttgnccacaa ncccaggaaa gtatttttaa 24 0 

gaaagattgg attttcctac ctttagagat ctaaaaaaaa tttaatataa aaaatcattt 300 

tgtgttggtg tttattacta gttcagatga gtggctgctg aaggggcccc cttgtcattt 360 

tcattataac ccaatttcca cttatttgaa ctcttaagtc ataaatgtat aatgacttat 42 0 

gaattagcac agttaagttg acactagaaa ctgcccattt ctgtattaca ctatcaaata 480 

ggaaacattg gaaagatggg gaaaaaaatc ttattttaaa atggcttaga aagttttcag 54 0 

attactttga aaattctaaa cttctttctg tttccaaaac ttgaaaatat gtagatggac 600 

tcatgcatta agactgtttt caaagctttc ctcacatttt taaagtgtga ttttcctttt 660 

aatatacata tttattttcy ttaaagcagc tatatcccaa cccatgactt tgggrgatat 720 

acccataaaa ccmatataac agcaggggta ttggagcagc tttctcaa 768 

<210> 833 
<211> 1604 
<212> DNA 

<213> Homo sapiens 
<400> 833 

aactagtata tttacaacat cagaaacttc aatatggaga tttgttgttc ctatatcatg 60 

atctttagca gcaactacac cataggcact gcacaacctg ggtcctagat caggacgtac 120 

aaaaaatcct ggcaaatgag aggccaaatt gaattttcct tctggattac aatattctgg 180 

caatggcaga ctttttaaaa gatcttcgta tcttgctggc atcatagtct tgaagtcttc 240 

tcctgaaggc caatctttca attttaaaac aactgtttct ccactcttgt ttttctgccg 300 

ttttgaaact tcttcaaaac catcccagaa ttccttaaca ttggcatttg aaatgatgct 36 0 

atctttgcag ttcaggagat cagcttggtg gtctccaaaa tcaagactaa ttgattccgc 420 

cttccatagg ctaatgttca ttttcttatg cacaccagaa accactgcag gctgtccttg 480 

tttccaacat tctttgaaaa gcttccaatt actgctattc ttataatcct taagccataa 540 

aatatgcttc tcacagatcc aagaatgtgg tatatcactg tataatttat tattttcatc 600 

cactgcagat attatgcttt cttcaggctc ttctttaagc tctggtttta catttatctt 660 

ggaggtttta cttggtggaa ttttgttttc aacaactgaa gcaattatgt catcaagaat 720 

gttaggcata gtccgtccac ttttgctact tggggctccc attgaatata ctggggcaaa 780 

ggcaatgcca gcatctgtas accccacacg tagctttcca gctgttgtag tcagcaaatc 840 

ccgtaaggtt gagccttgtt cattattctg ggacacaaga ggtgatgttc tgccatttgg 900 

agattcagag ttgtcttgtt ctctttcttc tttaatttgg ttttcaaggg taagttcttt 960 

gttttctttt ttttcctctc tggctttttg ctctgcaaga tctgctaacc agtgcagtgg 1020 
tgactgggat tctggaggag ttaacttgtt atctgtgcct acatcactct ctgggctgct . 1080 

gccaccattt ttctcagact tcggaggagt attttgctgc tgagactcag gcatgcacag 1140 

agaaatttta ttactgtgat taagaacatt ctgtaaaact tgagatacac cat teat tgt 1200 

aggaaaattt ccaacttgta aattctgttt gttagtacaa tgacaatggg atttaatacc 1260 

atatttttcc ctaagagtgt gcatggcatc tagaagatct gtcaaaacag aaccaggtat 132 0 

aatttgggtt ggcattaaat gtttgtgatc atgaggctgt ccyttcacac acttcatcca 1380 

agcatantag ttctttatcy etagaactne tycctttcct ttngccttgt aacaatctaa 1440 

gcaganccac aawkecacat tttkggcaga cccagtnraw kktaancawk gntgettcac 1500 

atgeatcaca catctcccgg actcctctca ctgctctttt ccaggcaatt ttggcatcct 1560 

ttttcaccca ggacaaagct gttttttcag atgttactaa ttga 1604 

<210> 834 
<211> 617 
<212> DNA 
<213> Homo sapiens 



<400> 834 
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gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 

cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 12 0 

aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 180 

aatttggaga aagagctcya ssangrtgrm srcrsgkkac gagaaggagt ttggagatgg 240 

atcggatgaa aatgaaatgg aagaacatga actcaaagat gaggaggatg gtaaagacag 300 

tgatgaggcc gaggacgctg agctctatga tgacctttac tgcccagcat gtgacaaatc 360 

gttcaagaca gaaaaggcca tgaagaatca cgagaagtca aagaagcatc gggaaatggt 420 

ggccttgcta aaacaacagc tggaggagga agaagaaaat ttttcaagac ctcaaattga 480 

tgaaaatcca ttagatgaca attctgagga agaaatggaa gatgcaccaa aacaaaagct 540 

ttctaaaaaa cagargaaaa agaaacagaa accagcacag gatgtacctg gcaaagattc 600 

atatctgcct gcagctc 617 



<210> 835 
<211> 542 
<212> DNA 
<213> Homo sapiens 



<400> 835 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 120 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 180 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 240 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 

gc 542 

<210> 836 
<211> 542 
<212> DNA 
<213> Homo sapiens 

<400> 836 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 12 0 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 180 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 24 0 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 

gc 542 



<210> 837 
<211> 719 
<212> DNA 

<213> Homo sapiens 
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<400> 837 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 

gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 

ttcgaatctg gmttccgct gtncagccga ccgnctgccc actatngact gacaaccaag 180 

gaaagtcttc cccantccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 24 0 

ctttgaacga tcccatgcaa agaattccca ccctgaggtg tttnacatac ctgcccaatg 300 

ncaaaggaac cgccttgtga gaccaaattg ctgacctggg tcagtgcatg tgctttatgg 360 

tgtggtgcat ctttggagat catcgccata ttttactttt gagagtcttt aaagaggaag 420 

gggagtggag ggaaccctga gctaggcttc aggaggcccg cgtcctacgc aggctctgca 480 

caggggttag accccaggtc cgacgcttga ccttcctggg cctcaagtgc cctcccctat 540 

caaatgacag ggatggacag catgacctct gggtgtctct ccaactcacc agttctaaaa 600 

agggtatcag attctattgt gacttcataa gtgagaattt atgatagatt attttttagc 660 

tattttttcc atgtgtgaac cttgagtgat actaatcatg taaagtaaga gttccctta 719 

<210> 838 
<211> 579 
<212> DNA 

<213> Homo sapiens 
<400> 838 

aagatatgca gagatattcc aggatctttt agctttggtg cggtctcctg gagacagtgt 60 

tattcgccaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 120 

cattgcactt atcttaccaa gctcttctga aggttctatt tctganctgg agcagctctc 180 

caattctcta ccaaataaag aattgatgac ctcaatctgt gactgtctgt tggctacgct 240 

agctaactct gagagcagtt acaactgttt actgacatgt gtcagaacaa tgatgtttct 300 

tgcagagatg attatggatt atttcattta aaaagttctt taaggaaaaa cagtagtgct 360 

ctgcatagtt tactgaaacg agtggtcagc acatttagta aggacacagg agagcttgca 420 

tcttcatttt tagaatttat gagacaaatt cttaactctg acacaattgg gatgctgtgg 480 

gagatgataa tgggtctcat gggaagtagg aggggagctc atacatcacg gacgatgagt 540 

attaatgctg cagagttaaa ccagcttctt ccaaggcaa 579 

<210> 839 

<211> 1172 

<212> DNA 

<213> Homo sapiens 

<400> 839 

aaccaaacct cccaacttag tgaaaacaag gcattcaatg acagaccagc agcagaaact 60 

gcntattacc tcctaatcat tttatgaaga aatacctata taaaaacaaa cactaaagag 120 

nacaaataga tttaactaaa gtgacaagca taattataaa taaataccag attatcagat 180 

tttaaacaat aatctataac agttttacta tctaaggatt ttcactccaa gaagaaaaaa 240 

tacatagtaa cgccaagctt gcaggacgat gacttaacag atacattttc tcttaatgga 300 

aacttatcta gcttcagtaa tatttctgga tgtagcatca agttgctgtt gcacattttt 360 

aaaagactgg tccagcagtg tttcctcttc atttaaagta ttggcaatag catcattaca 420 

tggattgtcc agaatgtctt cgtttaatcc atttgactcc tccttttgat cctcatcagt 480 

attaacctct tcaaccgtgt gtgccctggg tgtattcatt aacatatcat ttccyagggt 54 0 

ctgactatta ctcagcagct tkgcctgcct tctttccarg gccagttggt twatttcycy 600 

caattctttg ttgttgctct tctgttaggc ttctacttaa ctcagaagca aacatctcac 660 

tttcagataa gtttgtcaga aagggatcta attcagtaga agtgacatca tgttcattat 720 

tctccgcaac ttcatcatta ttgctaacaa aatcttcatg taaaataggg agatcaagtc 780 

gaattcgttt taaacaggtc tgaacttcct ttt tact tec caggtattca actctgtcaa 840 
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taaaatcctc aaactgcagt ttagggaata gcctatgtgc ccagtgctcc atgtgtctga 900 

ttagcatctt caagtcttca gcctcatgac ctttaccttt gaattttgcc ttatcaaata 960 

catgccttaa ggctggaagt cctctctctg aaattaatct ctgagcatcc agcttgggta 1020 

tatttctttt aactgttctc tttggaggta caggaacagg tgctccattt cctgactctt 1080 

catcaggctc agttccttca ccatcttgtc tctctggaga ggctggaggt gggaaaggag 1140 

gaaaagtttc atcttctaca tgctcataat ct 1172 

<210> 840 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<400> 840 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 120 

gatsmrgymg cgngassscm ggakywkgrs cwscwscsrs gymrgkwgca gnrgcrgygg 18 0 

crhcrsganc mrmagcagcn tgmwgcagct cawgcacctg gagtcctttt aygaaaaamc 24 0 

yyctcctggg cttatcaagg aagatgagac taagccagaa gattgcatac cagatgtacc 300 

aggcaatgaa cacgccaggg aatttctggc tcatgcacca actaaaggac tttggatgcc 36 0 

actggggaaa gaagtcaaag ttatgcagtg ttggcgttgc aaacgctatg gtcaccgaac 420 

gggtgacaaa gaatgccctt tctttatcaa aggcaaccaa aagttagagc agttcagagt 480 

ggcacatgaa gatcccatgt atgacatcat acgagacaat aaacgacatg aaaaggacgt 54 0 

aaggatacag cagttaaaac agttactgga ggattctacc tcagatgaag ataggagcag 600 

ctcctngttcc tctgaaggta aagagaaaca caagaaaaag aagaagaaag aaaagcataa 660 

gaaaaggaag aaagaaaaga aaaagaagaa aaaacggaag cacaaatctt ccaagtcaaa 720 

tgagggttct gactcagagt gacaaggatg tgacttgttc aacattctct tctcaaacac 780 

tgaccaagga acagaggaag atgcagtcag agaaagcagc aggatagaga cgccgagaga 84 0 

ggagtatatg tgggtcacag cagtgagctc ccacccgcct tgcagtgaag atgtgacccc 900 

aggagaggga gtgtctcctt ccaggtgcta gctctggaca gcagctgatt ttaggcagga 960 

aagtttcttc atcgttgtcc tccctgctgg tcacatgagt ttacgattcc tttgaagtgt 1020 

ctcccacagg gtggcaggac tgggagaatc tctgaggcgt gtcttccagg ccctcccaca 1080 

gcttgtgccc tccacagtgt ggactcaggt cccatagaca tcaggctgga gtcttctctg 1140 

ttgtt 1145 

<210> 841 
<211> 642 
<212> DNA 
<213> Homo sapiens 

<400> 841 

ttttttataa aaataaatat ttattgccat ttgaagcttt atgtacacct ttaaaagcac 60 

atgtacaaat gtgggaaatt acaaaaatca acctaaaacc ctttttctca aagtatacat 120 

aaatgtacat ccaagatcag tggtgctacc atcattagaa taaaaaataa gtctgtctgg 180 

acataaacaa gcaatcattt taagtgtcat tcagatattc tcctttatat ttaaaactcc 240 

aaaaaatact aagaggccca atatatccag aaaattgtgt tttcacttta ccctaactta 300 

tgaatagtgg tatacaaata tatttccatc tttttgtcca gccagcaaat gagagtctgt 360 

acccgaccat ttcacaaaag accaatgttg gtcagagaca gskskgagrr ksgymktasr 420 

stkamysasa akkarstsmm amayrgsrmt tnykcmasra stcamkmtyk ytgsyrcasr 480 

gwkrwctyws rmswmwmmwk msargramcca tttcagaata ggctttgtga cagactgaag 540 

cttggtaaga atcatcaatg tgcatctttt tcaggagttg accagttttt aaattccaaa 600 

taacaatgtt gttcataata gtagtaccaa gcagagcttc tt 642 
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<210> 842 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<400> 842 

acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 60 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctrcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccytgacc ttgtactaca 300 

gccactggcg caantgcgtg accgggagat ccagctggag atcagtggca aagagcggct 360 

ggaagacctg wacttccctg agatcaaacg aaggaagatg gctgacagga aggatgagga 420 

caggwagcaa tttaaagacc tcttttgacc tg 452 

<210> 843 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 843 

ggcttataca acatagtggg gaacgcatgg gaatggactt cagactggtg gactgttcat 60 

cattctgttg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 120 

aaaggtggat cctacatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 180 

agccagaaca cacctgatag ctctgcttcg aatctggrnt tccgctgtnc agccgaccgn 240 

ctgcccacta tngactgaca accaaggaaa gtcttcccca ntccaaggag cagtcgtgtc 300 

tgacctacat tgggcttttc tcagaacttt gaacgatccc atgcaaagaa ttcccaccct 360 

gaggtgtttn acatacctgc ccaatgncaa aggaaccgcc ttgtgagacc aaattgctga 42 0 

cctgggtcag tgcatgtgct ttatggtgtg gtgcatcttt ggagatcatc gccatatttt 480 

acttttgaga gtctttaaag aggaagggga gtggagggaa ccctgagcta ggcttcagga 540 

ggcccgcgtc ctacgcaggc tctgcacagg ggttagaccc caggtccgac gcttgacctt 600 

cctgggcctc aagtgccctc ccctatcaaa tgacagggat ggacagcatg acctctgggt 660 

gtctctccaa ctcaccagtt ctaaaaaggg tatcagattc tattgtgact tcataagtga 720 

gaatttatga tagattattt tttagctatt ttttccatgt gtgaaccttg agtgatacta 780 

atcatgtaaa gtaagagttc cctta 805 

<210> 844 
<211> 702 
<212> DNA 
<213> Homo sapiens 

<400> 844 

tttttttttt tttttttgca ggtgcatttg tttctttatt taaaaaaatc atctgggggc 60 

atggtctgag gaggacaccc ctcccatggc tttggggagg acgcaggttc caggagtcac 120 

agggcagaaa cacgcggggt gggtgggggc gtggccggag tggggagggg ctgtsccagg 180 

cacctggggg tggctcccac ggcaccaggt gggctagggc aacagtatgt acaggcgagc 240 

agtgctcctg gacccggtcg gggccggctg gggcccattt ctgcggcagg ggagctctgg 300 

ggcacagggt ctgagtccca tcttgggctk cagggaccgc gaggscgtcc agggaggctg 360 

gacagcgggg gcctttatct gggcccatca ggtggatgag aacggacact gcaaaccgct 420 

caccacctgg gccagggcta ggctatccgg cagggcctcc ccmmctgaat cctgcgtgcg 480 

cagaactcaa gccggcatnc aggcagtkgg aacgncccgc angctgggct tggktgsyck 540 



273 



WO 99/33982 PCT/US98/27610 



crsgcacgtg acaggtgggg cccgtgtcct gataaacgga caggaacaaa aggaacgcaa 600 
ggtctgggac ccacggctct gggagcagcg ccacccaggc tggctcctag cagagaaatg 660 
ggaatcgcaa atgcattgca atgtgcagtg aagagacgcg ag 702 
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